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(57) Abstract: The present invention relates generally to a method of genetic fingerprinting and more particularly to a method of 
establishing a methylation signature for a particular eukaryotic cell or group of cells such as in the form of tissue cells or organ- 
specific cells. The method of the present invention exploits the sensitivity of certain restriction enzymes to methylation. Exposure of 
genomic or transgene DNA to these enzymes followed by amplification results in products which establish the methylation signature 
or profile (i.e. the methylome profile) of a cell's or group of cells* genome when compared to the amplified genome in undigested 
form. The method of the present invention is useful inter alia in phenotyping a cell based on its methylation signature and provides a 
useful tool in functional genomics and for the design of therapeutic and trait-modifying protocols, particularly for animals and plants. 
The present invention can also be used to identify and map junctions between methylated and unmethylated DNA. The method of 
the present invention is also useful for identifying DNA methylation polymorphisms which can be used inter alia in diagnosis and 
forensics and for identifying particular genes, the function or absence of function of which are associated with a disease condition 
or trait. The method is also useful for monitoring the aging process of particular cells or an animal (including a human) or plant 
comprising such cells as well as monitoring the pluripotent or multipotent state of stem cells and development of stem cells through 
to maturation. 
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FIELD OF THE INVENTION 

5 The present invention relates generally to a method of genetic fingerprinting and more 
particularly to a method of establishing a methylation signature for a particular eukaryotic 
cell or group of cells such as in the form of tissue cells or organ-specific cells. The method 
of the present invention exploits the sensitivity of certain restriction enzymes to 
methylation. Exposure of genomic or transgene DNA to these enzymes followed by 

10 amplification results in products which establish the methylation signature or profile (i.e. 
the methylome profile) of a cell's or group of cells' genome when compared to the 
amplified genome in undigested form. The method of the present invention is useful inter 
alia in phenotyping a cell based on its methylation signature and provides a useful tool in 
functional genomics and for the design of therapeutic and trait-modifying protocols, 

15 particularly for animals and plants. The present invention can also be used to identify and 
map junctions between methylated and unmethylated DNA. The method of the present 
invention is also useful for identifying DNA methylation polymorphisms which can be 
used inter alia in diagnosis and forensics and for identifying particular genes, the function 
or absence of function of which are associated with a disease condition or trait. The 

20 method is also useful for monitoring the aging process of particular cells or an animal 
(including a human) or plant comprising such cells as well as monitoring the pluripotent or 
multipotent state of stem cells and development of stem cells through to maturation. 

BACKGROUND OF THE INVENTION 

25 

Bibliographic details of references provided in the subject specification are listed at the end 
of the specification. 

Reference to any prior art in this specification is not, and should not be taken as, an 
30 acknowledgment or any form of suggestion that this prior art forms part of the common 
general knowledge in any country. 
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The Human Genome Sequencing Project has provided an invaluable and near- 
comprehensive database of human genes. The challenge now is to understand how the 
genome is expressed in a certain fashion and in particular cell types to produce the traits 
5 that ultimately determine the nature and quality of life. In recent years, it has become 
increasingly apparent that DNA methylation plays a key role in the regulation of gene 
expression in higher organisms. DNA methylation (mainly methylation of cytosines in 
CpG sites) is thought to direct chromatin compaction and generally the switching off of 
genes. The importance of DNA methylation has been highlighted by its involvement in 
10 several human diseases, including cancer, and the list of such diseases is expected to 
expand rapidly in the post genomics era. 

Defining the genomic methylation signatures (or "methylomes") of healthy and diseased 
cells could lead to the understanding, early diagnosis and treatment of many human 
15 diseases. 

The evolutionary roles of DNA methylation in eukaryotes have been suggested to include 
host defence against transposons and retrotransposons (O'Neill et al., Nature 393: 68-72, 
1998) and developmental and tissue-specific gene regulation (Walsh and Bestor, Genes 

20 Dev. 13: 26-34, 1999). The role of DNA methylation in developmental gene control has 
been the subject of considerable controversy (Yoder et al, Trends Genet 13: 335-340, 
1997). A role for DNA methylation in human disease has also been widely suggested but 
difficult to demonstrate unequivocally. Nevertheless, there is now compelling recent 
evidence of its involvement in several human genetic diseases, including Rett Syndrome, 

25 ICF and Fragile X (Heinrich, Current Biology 10: 60-63, 2000). These diseases have been 
found to be linked to mutations at cytosine nucleotides, or to mutations in the genes 
encoding proteins that are involved in DNA methylation (Hendrich, 2000, supra). DNA 
methylation has also been implicated in carcinogenesis. Large scale genome-wide 
hypomethylation was detected in many tumor cells compared with their normal control 

30 cells (Laird and Jaenisch, Annu. Rev. Genet. 30: 441-464,1996). This deficiency in 
methylation is thought to cause genetic instability and subsequent tumor progression 
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(Lengauer et al, Proc. Natl Acad. Set USA 94: 2545-2550, 1997). Conversely, in a 
number of cases, silenced tumor suppressor genes have been correlated with methylation 
of the promoter (Baylin and Herman, Trends Genet 16: 168-174, 2000; Herman, Semin. 
Cancer Biol 9: 359-367, 1999). 

One of the problems limiting this important field of research has been the lack of efficient 
procedures for large scale assessment of DNA methylation at CpG sites around the 
genome. 
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SUMMARY OF THE INVENTION 

Throughout this specification, unless the context requires otherwise, the word "comprise", 
or variations such as "comprises" or "comprising", will be understood to imply the 
5 inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

Nucleotide and amino acid sequences are referred to by a sequence identifier number (SEQ 
ID NO:). The SEQ ID NOs: correspond numerically to the sequence identifiers <400>1 
1 0 (SEQ ID NO: 1 ), <400>2 (SEQ ID NO:2), etc. 

In accordance with the present invention, an amplification-based assay procedure has been 
developed to determine the methylation profile of nucleotides in the genome of a cell or 
group of cells. More particularly, the nucleotides are in the form of CpG or CpNpG sites. 

15 The ability to determine genomic and transgene methylomes in a cell or group of cells is 
an important tool in functional genomics and in developing the next generation of gene- 
expression modulating agents. Combining methylation profile with mapping enables a 
determination of the epigenetic consequences of internal and external stimuli. For example, 
methylation profiles may correlate with disease conditions or a propensity for a disease 

20 condition to develop or monitoring the aging process or the development process of cells. 
Furthermore, the methylation profile can be used to determine genes which either are 
expressed or are not expressed in certain disease states or with certain phenotypic traits. 
The identification of a condition or predisposition for development of a condition leads to 
the selection of therapeutic agents or protocols for the treatment of same. 

25 

The present invention is predicated in part on an amplification-based technology referred 
to herein as amplified methylation polymorphisms (AMP). The AMP technology 
determines the methylation profile of many thousands of CpG or CpNpG sites around the 
genome and provides a genetic profile of the methylation status of these sites. This genetic 
30 signature is the methylome fingerprint of a cell's or group of cells' genome. The present 
invention extends, however, to determination of methylation of sites other than CpG or 
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CpNpG sites. 

The AMP technology involves amplification of DNA markers in the form of small inverted 
repeats comprising the CpG or CpNpG sites but where amplification depends on the 
5 methylation status of the cytosines within the amplicon or nearby. 

In one particularly useful embodiment, the protocol uses a single arbitrary decamer 
oligonucleotide primer containing the recognition sequences of a methylation-sensitive 
restriction enzyme. These short oligonucleotide primers containing such recognition 
10 sequences are refeixed to herein as AMP primers. The recognition sequences for the 
methylation-sensitive restriction enzyme are located in the middle of the primer followed 
by up to four selective nucleotides, extending to the 3' end. AMP profiles are generated 
from both undigested genomic DNA and genomic DNA digested with the methylation 
sensitive enzyme. Comparison of the profiles from digested and undigested genomic DNA 
15 reveals three classes of AMP markers: digestion resistant (Class 1) indicative of 
methylation, digestion sensitive (Class II) indicative of non-methylation, and digestion 
dependent (Class ID). The nature of the last class of AMP markers is proposed to represent 
physically-linked c^-acting inhibitory sequences which suppress amplification of Class in 
markers from undigested template. Digestion with the enzyme removes the inhibitor from 
20 the amplicon, thereby allowing amplification. The digestion-dependent (Class EI) markers 
are proposed to encompass a methylated restriction site or sites in the amplicon sequence 
flanked by a non-methylated restriction site and then the putative inhibitory sequence. 
Digestion-dependent markers represent, therefore, junctions between methylated and non- 
methylated DNA in the genome. Cloning, sequencing and mapping AMP markers shows 
25 that they often correspond to CpG islands, features known to be landmarks for genes in 
genomes. Southern analysis has confirmed the utility of AMP for detecting DNA 
methylation polymorphisms between genomes and that Class HI AMPs represent the 
junction between methylated and unmethylated DNA in genomes. 

30 The methylome profile of a cell's genome, therefore, is provided by the determination of 
AMP markers. It is proposed that changes in methylation profile indicates, determines or 
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causes changes in patterns of gene expression and important biological phenotypes such as 
cancer and aging. 

The terms "methylation signature", "methylation profile" and "methylome" are used 
5 interchangeably throughout the specification and all indicate the methylation status of a 
cell's or group of cells' genome. They can also be used to determine the expression or 
absence of expression of particular genes. The method of the present invention is useful in 
determining methylation polymorphisms which can then be linked to particular traits 
and/or phenotypic characteristics or stages of aging or development and can also be useful 
10 in forensics to assist in better profiling a victim or perpetrator. 

The present invention contemplates, therefore, a method for determining the methylation 
profile of one or more nucleotides at one or more sites within the genome of a eukaryotic 
cell or group of cells by obtaining a sample of genomic DNA from the cell or group of 

15 cells, digesting a sub-sample of the sample of genomic DNA with a methylation-sensitive 
enzyme having a recognition nucleotide sequence comprising a nucleotide capable of 
methylation at one or more of the above-mentioned sites, subjecting the digested DNA to 
an amplification reaction using primers comprising a nucleotide sequence capable of 
annealing to a non-cleaved form of a cleavable nucleotide sequence of the methylation- 

20 sensitive enzyme recognition sequence and subjecting the products of the amplification 
reaction to separation or other detection means relative to a control. The control comprises 
another sub-sample of the sample of genomic DNA not subjected to digestion by the 
methylation-sensitive enzyme but subjected to an amplification reaction using the same 
primers as for the digested DNA sample and then subjecting the products to the 

25 amplification reaction to the separation or other detection means wherein the presence of 
amplification products in enzyme digested and non-digested samples is indicative of a 
methylation-sensitive enzyme-digestion-resistant marker, the absence and presence of 
amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-sensitive marker and the presence 

30 and absence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a methylation-sensitive enzyme-digestion-dependent marker. 
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The method of the present invention determines, therefore, a methylation profile 
comprising methylated or nnmethylated CpG or CpNpGp islands characterized as being a 
methylation-sensitive enzyme-digestion-resistant marker, a methylation-sensitive enzyme 
5 digestion-sensitive marker or a methylation-sensitive enzyme-digestion-dependent marker. 
A methylation profile is developed showing the presence or absence of particular 
methylated CpG or CpNpGp islands as well as junction sites between methylated and 
unmethylated regions. These are the methylation-sensitive enzyme-digestion-dependent 
AMP markers. 

10 

Methylation enzymes contemplated herein include AafU, Aril, Acll, Agel, Asc\ Aval, 
Bamm, BsaAl, BsaHl, BsiE, BsW, BsrF, BssHU, BstBU BstXJl Clal, Eagl HaeU, Hgal 

N 

Hhal, HiriPI, HpaU, Midi Mspl Nael, Narl, Notl Nnil and PmlL HpaU is particularly 
preferred in accordance with the present invention. 

15 

Accordingly, another aspect of the present invention provides a method for determining the 
methylation profile of one or more CpG or CpNpG nucleotides at one or more sites within 
the genome of a eukaryotic cell or group of cells by obtaining a sample of genomic DNA 
from the cell or group of cells, digesting a sub-sample of the sample of genomic DNA with 

20 HpaU which has a recognition nucleotide sequence corresponding to or within the sites, 
subjecting the digested DNA to an amplification means such as polymerase chain reaction 
(PCR) using primers comprising a nucleotide sequence capable of annealing to a non- 
cleaved form of a HpaU cleavable nucleotide sequence and subjecting the products of the 
PCR to separation or other detection means relative to a control. The control comprises in 

25 this embodiment another sub-sample of the sample of genomic DNA not subjected to 
digestion by HpaU but subjected to an amplification reaction using the same primers as for 
the digested DNA sample and then subjecting the products to the amplification reaction to 
the separation or detection means wherein the presence of PCR products in enzyme 
digested and non-digested samples is indicative of a ApaE-digestion-resistant marker (H 1 ), 

30 the absence and presence of PCR products in enzyme digested and undigested samples, 
respectively, is indicative of a flpan-digestion-sensitive marker (II s ) and the presence and 
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absence of PGR products in enzyme digested and undigested samples, respectively, is 
indicative of a i^pall-digestion-dependent marker (H d ). 

Another aspect of the present invention contemplates an assay device in the form of a kit 
5 useful for detennining the methylation profile of one or more nucleotides at one or more 
sites within the genome of a eukaryotic cell, the kit comprising in compartmental form 
multiple compartments each adapted to comprise one or more of buffers, diluents and 
enyzmes as single or multiple components which are optionally required to be admixed 
prior to use, the kit further comprising instructions for use, wherein the method is 

10 conducted by obtaining a sample of genomic DNA from said cell, digesting a sub-sample 
of said sample of genomic DNA with a methylation-sensitive enzyme having a recognition 
nucleotide sequence comprising a nucleotide capable of being methylated at one or more 
of the above-mentioned sites, subjecting the digested DNA to an amplification reaction 
using primers comprising a nucleotide sequence capable of annealing to a non-cleaved 

15 form of a cleavable nucleotide sequence of the methylation-sensitive enzyme recognition 
sequence and subjecting the products of the amplification reaction to separation or 
detection means relative to a control, said control comprising another sub-sample of said 
sample of genomic DNA not subjected to digestion by the methylation-sensitive enzyme 
but subjected to an amplification reaction using the same primers as for the digested DNA 

20 sample and then subjecting the products to the amplification reaction to the separation or 
other detection means wherein the presence of amplification products in enzyme digested 
and non-digested samples is indicative of a methylation-sensitive enzyme-digestion- 
resistant marker, and wherein the absence and presence of amplification products in 
enzyme digested and undigested samples, respectively, is indicative of a methylation- 

25 sensitive enzyme-digestion-sensitive marker and wherein the presence and absence of 
amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-dependent marker. 

Preferably, as indicated above, the methylation sensitive enzyme is HpaTL. Conveniently, 
30 the amplification reaction is a PCR. 
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The present method can also be used to detect a change in a cell's or group of cells' 
developmental state or a cell's or group of cells' exposure to an internal or external 
stimulus by detecting a change in methylation profile over time. This may be particularly 
relevant and useful in monitoring stem cells including stem cells in developmental 
5 transition and including embryonic stem (ES) cells, embryonic germ (EG) cells and mature 
or committed cells. It is also particularly useful in monitoring an animal or plant cell in 
tissue culture and/or following recombinant procedures. The method is also useful in 
forensics to identify potential characteristics of an individual based on the methylation 
profile of DNA samples from a victim or perpetrator. The methylation profile further 
10 provides an indicator as to the likely sensitivity of a cell or group of cells to a virus or 
transposon. 

The method of the present invention is also useful for monitoring the state and/or extent of 
methylation of cloned, non-human embryos such as livestock embryos. These embryos can 
15 be checked for unwanted methylation prior to transplantation as well as after 
transplantation. 

Alternatively, the present invention provides a method for detecting a change in a cell's or 
group of cells' physiological status such as from a normal state to a disease state. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagrammatic representation showing the AMP protocol and the Class I, II 
and HI markers using as an example, the methylation-sensitive endonuclease restriction 
5 enzyme, HpaU. 

Figure 2 is a diagrammatic representation showing a Class III marker detected by the 
AMP protocol in one cell type but not another. Cloning, sequencing and Southern analysis 
has shown that inhibitory sequences are often associated with or linked to classical CpG 
10 islands. 

Figure 3 is photographic representation showing different classes of AMP markers. 
Comparison of profiles amplified with an AMP primer from either iipall-digested (H) or 
undigested (U) genomic DNA. The profiles in the figure were generated from replicate 
15 DNA extractions from blood of a single human being. The figure demonstrates the 
reproducibility of the AMP protocol and the three classes of AMP markers: digestion 
sensitive (left-side arrows), digestion dependent (right-side arrows) and digestion resistant 
(without arrows). 

20 Figure 4 is a photographic representation showing tissue-specific AMP profiles observed 
for human blood and spermatozoa. Profiles were amplified from ii/pall-digested (H) and 
undigested (U) genomic DNA of six humans (Bl, B2, B3, SI, S2, and S3). Six different 
individuals contributed the samples; Bl, B2, and B3 were blood samples, whereas SI, S2 
and S3 were spermatozoa samples. Tissue specific digestion-dependent (upper panel) and 

25 digestion-sensitive (lower panel) DNA methylation polymorphisms (see arrows) were 
readily observed. DNA was extracted and analyzed in duplicate from each tissue sample. 

Figure 5 is a diagrammatic representation showing map location of four human digestion- 
dependent Class in AMP markers and the flanking HpaU sites. Each of these digestion- 
30 dependent markers that was cloned and sequenced was 100% homologous to sequenced 
regions of the human genome. The positions of the Class m amplicons (HdM 1-4) are 
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indicated by thick shaded boxes on the chromosomal segment. The locations of HpaU sites 
are represented by vertical lines and the distance to, and orientation of nearby genes are 
indicated. Cloning, sequencing and mapping of these and additional Class in AMP 
markers has confirmed that they often map to CpG islands. 

5 

Figure 6 is a photographic representation showing distinct methylomes of tissues from 
testis, peritoneal fat pad and bone marrow of mice. 

Figure 7 is a photographic representation showing widespread DNA methylation 
10 polymorphisms in mouse macrophage cell lines. 

Figure 8 is representation showing that AMP is a powerful diagnostic tool for identifying 
stable cell lines, e.g. sub-cultures (1, 2 and 3 in replicate) of the human colon carcinoma 
cell line HCT1 16 are identical for DNA methylation and DNA sequence. 

15 

Figure 9 is a representation showing that AMP detects DNA methylation in a p53 knock- 
out line compared to the parent cell line HCT116. Both cell lines are stable between sub- 
cultures (1, 2 and 3), but AMP detects widespread DNA methylation and DNA sequence 
changes between the two cell lines. 

20 

Figure 10 is a photographic representation showing that Southern hybridization confirms 
tissue-specific AMPs between sperm (S) and blood (B). 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODEvIENTS 

The present invention provides an amplification-based assay to determine the methylome's 
profile in the genome of a eukaryotic cell or group of cells. 

5 

Determination of the methylation profile includes determining a methylation fingerprint or 
signature of global methylation within a cell's or group of cell's genome or at specific sites 
or regions. The terms "methylome" and "methylomes" refer to the generally multiple sites 
of methylation within a cell's or group of cells* genome. These sites are also generally 

10 distinctive. The methylome profile is proposed to change with developmental changes, 
internal and external stimuli (e.g. environmental factors, RNAi, epigenetic signals) and 
disease conditions (e.g. cancer). The methylome profile also directs changes in gene 
expression in a cell. This can lead to different developmental, physiological or metabolic 
states or conditions. It can also lead to the identification of one or more genes or other 

15 genetic sequences such as promoter sequences involved or associated with a disease 
condition or trait. This is important, for example, during proliferation, development and 
self-renewal of precursor cells such as embryonic stem (ES) cells, embryonic germ (EG) 
cells and post-natal (e.g. adult) or committed cells. It is also relevant in monitoring plant or 
animal cells in tissue culture and/or following recombinant DNA procedures as well as 

20 monitoring the extent of methylation of animal embryos such as in the livestock industry. 
The embryos may be checked at pre- and post-transplantation stages. 

Accordingly, one aspect of the present invention contemplates a method for determining 
the methylation profile of one or more nucleotides at one or more sites within the genome 

25 of a eukaryotic cell or group of cells, said method comprising obtaining a sample of 
genomic DNA from said cell or group of cells, digesting a sub-sample of said sample of 
genomic DNA with a methylation-sensitive enzyme having a recognition nucleotide 
sequence comprising a nucleotide capable of methylation at one or more of the above- 
mentioned sites, subjecting said digested DNA to an amplification reaction using primers 

30 comprising a nucleotide sequence capable of annealing to a non-cleaved form of a 
cleavable nucleotide sequence of the methylation-sensitive enzyme recognition sequence 
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and subjecting the products of the amplification reaction to separation or other detection 
means relative to a control but subjected to an amplification reaction using the same 
primers as for the digested DNA sample and then subjecting the products of the 
amplification reaction to the separation and/or detection means, said control comprising 

5 another sub-sample of said sample of genomic DNA not subjected to digestion by the 
methylation-sensitive enzyme wherein the presence of amplification products in enzyme 
digested and non-digested samples is indicative of a methylation-sensitive enzyme- 
digestion-resistant marker, the absence and presence of amplification products in enzyme 
digested and undigested samples, respectively, is indicative of a methylation-sensitive 

10 enzyme-digestion-sensitive marker and the presence and absence of amplification products 
in enzyme digested and undigested samples, respectively, is indicative of a methylation- 
sensitive enzyme-digestion-dependent marker. 

The method of this aspect of the present invention involves obtaining a sample of DNA 
15 and digesting one sub-sample with a methylation-sensitive enzyme and keeping another 
sub-sample undigested. Both samples are then subjected to amplification with a primer 
having a complementary sequence to a recognition sequence for the methylation-sensitive 
enzyme. If methylation has occurred at a particular site, the complementary sequence of 
the primer is intact as the enzyme is unable to digest the nucleotide sequence at that site. 
20 An amplification product is, therefore, present in the digested and undigested samples. If 
methylation has not occurred, digestion with the methylation sensitive enzyme destroys the 
primer-binding site. Consequently, the undigested sample will have an amplification 
product but the digested sample will not. Finally, if the methylated site is flanked or 
proximal to an amplification inhibiting region, the digestion will remove that region to 
25 enable an amplification product but this will be absent in the undigested sample due to the 
inhibition. This is regarded as a junction between methylated and unmethylated DNA. 
Thus, the AMP protocol detects junctions between methylated and unmethylated DNA in 
the genome. 

30 In a preferred aspect of the present invention, a global methylation signature is determined 
using methylation-sensitive restriction endonuclease sites within islands of repeating 
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nucleotides. The most preferred islands are referred to as CpG or CpNpG islands where N 
is any nucleotide. The most preferred methylation-sensitive restriction endonuclease sites 

I 

within or comprising CpG or CpNpG islands are HpaU sites. A HpaU site is CCGG. 



Accordingly, another aspect of the present invention contemplates a method for 
determining the methylation profile of one or more CpG or CpNpG nucleotides at one or 
more sites within the genome of a eukaryotic cell, said method comprising obtaining a 

10 sample of genomic DNA from said cell, digesting a sub-sample of said sample of genomic 
DNA with HpaU which has a recognition nucleotide sequence corresponding to or within 
said sites, subjecting said digested DNA to an amplification reaction using primers 
comprising a nucleotide sequence capable of annealing to a non-cleaved form of a HpaU 
cleavable nucleotide sequence and subjecting the products of the amplification reaction to 

15 separation or other detection means relative to a control but subjected to an amplification 
reaction using the same primers as for the digested DNA sample and then subjecting the 
products of the amplification reaction to the separation or other detection means, said 
control comprising another sub-sample of said sample of genomic DNA not subjected to 
digestion by HpaU, wherein the presence of amplification products in enzyme digested and 

20 non-digested samples is indicative of a //pall-digestion-resistant marker (H 1 ), the absence 
and presence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a flpall-digestion-sensitive marker (H s ) and the presence and 
absence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a i2pa H-digestion-dependent marker (H d ). 



Whilst HpaU, and its functional equivalents, are the most convenient methylation-sensitive 
enzyme for use in the present invention up to the present time, one skilled in the art would 
immediately recognize that other enzymes may be employed with recognition nucleotide 
sequences within, comprising or adjacent to a CpG or CpNpG nucleotide sequences. A 
30 recognition nucleotide sequence is the sequence of nucleotides which is essential for the 
cleavage by a restriction endonuclease. A recognition nucleotide sequence is a sequence 
which is cleaved by the enzyme. A list of other enzymes which may be used include AatU, 
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AciU Acll, Age\ Ascl, Aval, BamHl, BsaAl, BsdRl, BstE, BsiW, BsrF, BssHE, BstBl, 
BstXJl Ctol, Eagl, HaeSl, Hgal, Hhal, HinPl, Midi, Mspl, Nael, Narl Nofl, Nrul and 
PmK. 

5 The term "digestion" means that the DNA and restriction enzyme are brought together in 
physical biochemical contact for a time and under conditions sufficient for the cleavable 
nucleotide sequences to undergo cleavage. Digestion is generally for a time to permit 
cleavage of substantially all cleavable sites specific for the particular enzyme. 

10 The term "genomic DNA" includes all DNA in a cell, group of cells, or in an organelle of 
a cell and includes exogenous DNA such a transgenes introduced into a cell. 

Any amplification methodology may be employed in accordance with the present 
invention. Amplification methodologies contemplated herein include the polymerase chain 

15 reaction (PCR) such as disclosed in U.S. Patent Nos. 4,683,202 and 4,683,195; the ligase 
chain reaction (LCR) such as disclosed in European Patent Application No. EP-A-320 308 
and gap filling LCR (GLCR) or variations thereof such as disclosed in International Patent 
Publication No. WO 90/01069, European Patent Application EP-A-439 182, British Patent 
No. GB 2,225,1 12A and International Patent Publication No. WO 93/00447. Other 

20 amplification techniques include Qp replicase such as described in the literature; Stand 
Displacment Amplification (SDA) such as described in European Patent Application Nos. 
EP-A-497 272 and EP-A-500 224; Self-Sustained Sequence Replication (3SR) such as 
described in Fahy et al. (PCR Methods Appl. 1(1): 25-33, 1991) and Nucleic Acid 
Sequence-Based Amplification (NASBA) such as described in the literature. 

25 

Some amplification reactions, for example, PCR and LCR, involve cycles of alternately 
high and low set temperatures, a process known as "thermal cycling". PCR or "polymerase 
chain reaction" is an amplification reaction in which a polymerase enzyme, usually 
thermostable, generates multiple copies of the original sequence by extension of a primer 
30 using the original nucleic acid as a template. PCR is described in more detail in U.S. Patent 
Nos. 4,683,202 and 4,683,195. LCR or "ligase chain reaction" is a nucleic acid 
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amplification reaction in which a iigase enzyme, usually thermostable, generates multiple 
copies of the original sequence by ligating two or more oligonucleotide probes while they 
are hybridized to the target. LCR and its variation, Gap LCR, are described in more detail 
in European Patent Application Nos. EP-A-320-308 and EP-A-439-182 and International 
5 Patent Publication No. WO 90/100447 and elsewhere. 

The PCR amplification process is the most preferred in practising the present invention. 

Accordingly, in a preferred aspect of the present invention, there is provided a method for 

10 determining the methylation profile of one or more CpG or CpNpG nucleotides at one or 
more sites within the genome of a eukaryotic cell or group of cells, said method 
comprising obtaining a sample of genomic DNA from said cell or group of cells, digesting 
a sub-sample of said sample of genomic DNA with HpaU which has a recognition 
nucleotide sequence corresponding to or within said sites, subjecting said digested DNA to 

15 a polymerase chain reaction (PCR) using primers comprising a nucleotide sequence 
capable of annealing to a non-cleaved form of a HpaU cleavable nucleotide sequence and 
subjecting the products of the PCR to separation or other detection means relative to a 
control but subjected to an amplification reaction using the same primers as for the 
digested DNA sample and then subjecting the products of the amplification reaction to the 

20 separation or other detection means, said control comprising another sub-sample of said 
sample of genomic DNA not subjected to digestion by HpaU wherein the presence of PCR 
products in enzyme digested and non-digested samples is indicative of a Z/pall-digestion- 
resistant marker (H 1 ), the absence and presence of PCR products in enzyme digested and 
undigested samples, respectively, is indicative of a flpall-digestion-sensitive marker (H s ) 

25 and the presence and absence of PCR products in enzyme digested and undigested 
samples, respectively, is indicative of a flpall-digestion-dependent marker (H d ). 

The term "separation means" extends to any profiling protocol aimed at distinguishing 
between different amplification products. The term "separation means" may also be read as 
30 "separation or other detection means". Gel electrophoresis is the most convenient as this 
permits comparison of individual bands following separation. The term "gel" includes 



INSDOC1D: <WO 0302521 5A1_L> 



. ) 

WO 03/025215 PCT/AU02/01262 



- 17- 

agarose and polyacrylamide amongst others. A range of other separation means may also 
be applied such as those involving spectrophotometric, fluorometric and mass 
spectrographic methods. These latter methods are particularly useful for detecting a 
specific amplicon or amplicons. 

5 

In one particular embodiment, the presence or absence of AMP markers is assayed in real- 
time using a real-time PCR. As indicated above, PCR is well known to those in the art. A 
real-time PCR reaction includes a PCR reaction in which the products are detected during 
the PCR reaction. Preferably, the detection of the PCR product or products is made during 
10 the linear phase of a PCR reaction. It is known to those skilled in the art that, during the 
linear phase of a PCR the quantity of a product or amplicon formed is proportional to the 
quantity of target sequence present at the initiation of the PCR 

In a preferred real-time PCR a labeled probe molecule capable of hybridizing to the target 
15 sequence (e.g. an AMP marker) is used in the PCR In subsequent amplification rounds, 
the probe is displaced from the amplicon by the 5' exonuclease activity of Taq polymerase. 
Generally, but not exclusively, the probes are labeled at the 5' nucleotide and the 
exonuclease activity releases the 5' probe molecule. Release of the labeled nucleotide from 
the probe results in the ability to detect the presence of a specific PCR product. Real-time 
20 PCR assays are well known to those skilled in the art and include those disclosed in U.S. 
Patent No. 6,277,638. 

The term "label" means a moiety which is conjugated to a probe or an oligonucleotide 
moiety. A labeled probe or oligonucleotide moeity can be detected, identified or captured 

25 by virtue of the label of moiety which is conjugated to the probe. A "directly detectable" 
label is a signal-producing label which is capable of detection either directly or through its 
interaction with a substance such as a substrate (in the case of an enzyme), a light source 
(in the case of a fluorescent compound) or a photomultiplier tube (in the case of a 
radioactive or chemiluminescent compound). A "proximity label" is one of at least two 

30 labels which interact with each other to produce a detectable signal when the proximity 
labels are brought together. Typically, a first proximity label is used in combination with a 
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corresponding second proximity label. An "indirectly detectable'* label is a substance 
which in and of itself does not provide a signal but which can be utilized to identify an 
oligonucleotide to which the indirectly detectable label is attached. For example, biotin can 
be an indirectly detectable label. 

5 

DNA molecules are said to have a "5 '-terminus" (5* end) and a "3 '-terminus" (3' end) 
because nucleic acid phosphodiester linkages occur to the 5 1 carbon and 3' carbon of the 
pentose ring of the substituent mononucleotides. A terminal nucleotide as used herein is 
the nucleotide at the end position of the 3'- or 5 - terminus. As used herein, a nucleotide 
10 sequence, even if internal to a larger oligonucleotide or polynucleotide, can also be said to 
have 5 f - and 3 f - ends. 

Reference herein to a "multiplex PCR" refers to a real-time PCR that contains more than 
one probe. In this embodiment, AMP marker separation is achieved by monitoring either 
15 the accumulation of a specific labeled nucleotide or monitoring the depletion of a 
specifically labeled nucleotide from the PCR. 

Accordingly, another aspect of the present invention contemplates a method for 
determining the methylation profile of one or more nucleotides at one or more sites within 

20 the genome of a eukaryotic cell or group of cells, said method comprising obtaining a 
sample of genomic DNA from said cell or group of cells, digesting a sub-sample of said 
sample of genomic DNA with a methylation-sensitive enzyme having a recognition 
nucleotide sequence comprising a nucleotide capable of methylation at one or more of the 
above-mentioned sites, subjecting said digested DNA to an amplification reaction using 

25 primers comprising a nucleotide sequence capable of annealing to a non-cleaved form of a 
cleavable nucleotide sequence of the methylation-sensitive enzyme recognition sequence 
and subjecting the products of the amplification reaction to real-time PCR to distinguish 
amplification products relative to a control but subjected to an amplification reaction using 
the same primers as for the digested DNA sample and then subjecting the products of the 

30 amplification reaction to the separation or other detection means, said control comprising 
another sub-sample of said sample of genomic DNA not subjected to digestion by the 
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methylation-sensitive enzyme wherein the presence of amplification products in enzyme 
digested and non-digested samples is indicative of a methylation-sensitive enzyme- 
digestion-resistant marker, the absence and presence of amplification products in enzyme 
digested and undigested samples, respectively, is indicative of a methylation-sensitive 
5 enzyme-digestion-sensitive marker and the presence and absence of amplification products 
in enzyme digested and undigested samples, respectively, is indicative of a methylation- 
sensitive enzyme-digestion-dependent marker. 

More particularly, the present invention provides a method for determining the methylation 

10 profile of one or more CpG or CpNpG nucleotides at one or more sites within the genome 
of a eukaryotic cell or group of cells, said method comprising obtaining a sample of 
genomic DNA from said cell, digesting a sub-sample of said sample of genomic DNA with , 
HpaH which has a recognition nucleotide sequence corresponding to or within said sites, 
subjecting said digested DNA to an amplification reaction using primers comprising a 

15 nucleotide sequence capable of annealing to a non-cleaved form of a HpaR cleavable 
nucleotide sequence and subjecting the products of the amplification reaction to distinguish 
amplification products relative to a control but subjected to an amplification reaction using 
the same primers as for the digested DNA sample and then subjecting the products of the 
amplification reaction to the separation or other detection means, said control comprising 

20 another sub-sample of said sample of genomic DNA not subjected to digestion by HpaU, 
wherein the presence of amplification products in enzyme digested and non-digested 
samples is indicative of a i^paE-digestion-resistant marker (H 1 ), the absence and presence 
of amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a ifpall-digestion-sensitive marker (H s ) and the presence and absence of 

25 amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a jHpall-digestion-dependent marker (H d ). 

Reference herein to a "probe molecule" refers to a labeled nucleotide molecule which is 
capable of hybridizing to a target sequence. A target sequence is generally a DNA 
30 amplicon produced by a first amplification reaction following restriction digestion by a 
methylation-sensitive enzyme. Methods for the production of nucleic acid probes suitable 
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for use in PGR are well known in the art. Methods for the determination of a suitable size 
and sequence of an oligonucleotide are also well known in the art. 

In one particularly useful embodiment, the probe molecule is fluorescently labeled at its 5 1 
5 end. 

Fluorescent nucleotide analogs are widely known and commercially available from several 
sources. An exemplary source is NEN (trademark) Life Science Products (Boston, 
Massachusetts, USA) which offers dideoxy-, deoxy- and ribonucloetide analogs labeled 
10 with fluorescein, courmarin, tetramethylrhodamine, naphthofluorescein, pyrene, Texas Red 
(registered trademark) and Lissamine (trademark). Other suppliers include Amersham 
Pharmacia Biotech (Uppsala, Sweden; Piscataway, NJ, USA) and MBI Fermentas, Inc. 
(Amherst, NY, USA). 

15 An advantage to using fluorescent labels and fluorescence spectroscopic analysis is that 
there are multiple different labels available. Such different labels can be particularly useful 
in a multiplex embodiment of the present invention. Different fluorescent labels may be 
used in different probes so that the detection of a particular fluorescently-labeled 
nucleotide can be distinguished from others and used to deduce which AMP markers are 

20 present. 

For example, fluorescein has a 488 nm excitation and 520 nm emission wavelength, 
whereas rhodamine (in the form of tetramethylrhodamine) has 550 nm excitation and 575 
nm emission wavelength. A fluorescence detector provides an excitation source and an 
25 emission detector. The emission wavelengths of 520 nm and 575 nm are easily 
distinguishable using fluorescence spectroscopy. 

On a per molecule basis, fluorescence spectroscopy is about 10-fold more sensitive than 
absorbance spectroscopy. A very wide variety of fluorescence spectroscopy-based 
30 detectors are commercially available for reading fluorescence values of single tubes, flow 
cells and multi-well plates, amongst others. For example, Labsystems Multiskan models of 
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microplate readers are widely available with a spectral range of 400-750 nm and filters for 
340, 405, 414, 450, 492, 540, 620 and 690 nm (e.g. Fisher Scientific, Pittsburgh, PA, 
USA). 

5 In a related embodiment, the present invention contemplates the use of fluorescent 
resonant energy transfer (FRET) for the detection of a PCR amplicon. FRET is a form of 
molecular energy transfer by which energy is passed between a donor molecule and an 
acceptor molecule. FRET arises from the properties of certain chemical compounds. When 
excited by exposure to particular wavelengths of light, they emit light (i.e. they fluoresce) 
10 at a different wavelength. Such compounds are termed "fluorophores". In FRET, energy is 
passed over a long distance between a donor molecule which is a fluorophore and an 
acceptor molecule. The donor absorbs a photon and transfers energy to the acceptor. 

Molecules that are commonly used in FRET include fluorescein, 5-carboxyfluorescein 
15 (FAM), TT dimethoxy^^-dichloro-e-carbosy^uorescein (JOI), rhodamine, 6-carboxy- 
rhodamine (R6G), N,N,N,N-tetramethyl-6-carboxyrhodamine (TAMRA), 6-carboxy-X- 
rhodamine (ROX), 4-(4 , -dimethylaminophenylazo) benzoic acid (DABCYL) and 5-(2 l - 
aminoethyl)aininonaphthalene-l-sulfonic acid (EDANS). 

In another aspect of the present invention, absorbance spectrographic analysis is 
contemplated to provide an analysis of AMP markers generated in a PCR reaction thereby 
providing for the detennination of the methylation status of a gene. In this aspect of the 
present invention, a chromatographic separation with light absorbance detection of an 
AMP marker amplicon in a PCR mixture can be used to measure the quantity of PCR 
product. In this case, the second PCR employs a specific probe/primer to identify a 
particular amplicon. In an illustrative embodiment, a multiplexed assay for the presence of 
several different nucleic acid target sequences in a sample is analyzed by absorbance 
spectroscopy. Several labeled probes to various nucleic acid target sequences are added to 
a nucleic acid sample. The labels on the probes may be various nucleotide analogs, a 
different one for each probe. A PCR reaction is conducted and the quantity of a PCR 
product is measured by adsorbance spectroscopy. In this aspect of the present invention, 
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the PGR is directly loaded onto a pre-equiiibrated high pressure liquid chromatography 
(HPLC) column and eluted under conditions that separate the nucleotide analogs from the 
natural nucleotides. Useful media for chromatographic separation of nucleotides, bases and 
nucleosides include reverse phase media (such as a reverse phase CI 8 column or ODS- 
5 80T m or ODS-120T TSK-GEL by TosoHaas [Montgomeryville, PA, USA]), anion 
exchange media (such as DEAE-25SW or SP-25W TSK-GEL by TosoHaas 
[Montgomeryville, PA, USA]) or affinity media (such as Boronate-5PW TSK-GEL by 
[Montgomeryville, PA, USA]). 

The HPLC column is fitted with an absorbance detector to monitor the column effluent, 
hence, "absorbance spectroscopy" for this type of analysis. Typical wavelengths for 
monitoring HPLC detection of nucleotides are 250 nm, 260 nm and 280 nm. Thus, the 
quantity of a PCR product may be determined in real-time by periodic sampling of the 
PCR. 

Identification of the AMP marker PCR products can be accomplished by comparison of the 
retention times (as monitored by absorbance of effluent at various times) of standards of 
the nucleotide analogs separated on the same HPLC column under the same conditions. 
Alternatively, the identity of the nucleotide analogs collected in separate fractions after 
chromatographic separation (as determined by monitoring the absorbance of the column 
effluent) can be determined by other standard analytical methods such as gel 
electrophoresis. In an alternative aspect, the PCR mixture is separated on a gas 
chromatograph fitted with an absorbance detector to monitor column effluent. 

25 In another embodiment, the AMP marker or multiplex of AMP markers generated in a 
single PCR reaction are hybridized to a microarray of potentially complementary 
nucleotide sequences such as DNA sequences. As a single oligonucleotide of the present 
invention (comprising, for example, a HpaU site) may generate a plurality of AMP 
markers belonging to different AMP classes, a rapid method is required to determine the 

30 nucleotide sequence on the AMP markers. The nucleotide sequence identity of a particular 
AMP marker can be determined by reference, for example, to a known genomic DNA 
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sequence. Suitable arrays of DNA for the analysis of AMP markers include microarrays of 
human genes and/or contigs derived from genomic libraries, or particularly, arrays or CpG 
islands in genomes. Hybridization of a PCR product comprising one or a plurality of AMP 
markers facilitates the isolation and identification of genetic sequences. 

5 

A "nucleic acid" as used herein, is a covalently linked sequence of nucleotides in which 
the 3 ! position of the phosphorylated pentose of one nucleotide is joined by a 
phosphodiester group to the 5* position of the pentose of the next nucleotide and in which 
the nucleotide residues are linked in specific sequence; i.e. a linear order of nucleotides. A 

10 '^polynucleotide" as used herein, is a nucleic acid containing a sequence that is greater than 
about 100 nucleotides in length. An "oligonucleotide" as used herein, is a short 
polynucleotide or a portion of a polynucleotide. An oligonucleotide typically contains a 
sequence of about two to about one hundred bases. The word "oligo" is sometimes used in 
place of the word "oligonucleotide". The term "oligo" also includes a particularly preferred 

15 primer length in the practice of the present invention of up to about 10 nucleotides. 

As used herein, the term **primer" refers to an oligonucleotide or polynucleotide that is 
capable of hybridizing to another nucleic acid of interest under particular stringency 
conditions. A primer may occur naturally as in a purified restriction digest or be produced 
20 synthetically, by recombinant means or by PCR amplification. The terms "probe" and 
"primers" may be used interchangeably, although to the extent that an oligonucleotide is 
used in a PCR or other amplification reaction, the term is generally "primer". The ability to 
hybridize is dependent in part on the degree of complementarity between the nucleotide 
sequence of the primer and complementary sequence on the target DNA. 

25 

The terms "complementary" or "complementarity" are used in reference to nucleic acids 
(i.e. a sequence of nucleotides) related by the well-known base-pairing rules that A pairs 
with T or U and C pairs with G. For example, the sequence 5 f -A-G-T-3' is complementary 
to the sequence S'-T-C-A-S* in DNA and 3 , -U-C-A-5 f in RNA. Complementarity can be 
30 "partial" in which only some of the nucleotide bases are matched according to the base 
pairing rules. On the other hand, there may be "complete" or "total" complementarity 
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between the nucleic acid strands when all of the bases are matched according to base- 
pairing rules. The degree of complementarity between nucleic acid strands has significant 
effects on the efficiency and strength of hybridization between nucleic acid strands as 
known well in the art. This is of particular importance in detection methods that depend 
5 upon binding between nucleic acids, such as those of the invention. The term "substantially 
complementary" is used to describe any primer that can hybridize to either or both strands 
of the target nucleic acid sequence under conditions of low stringency as described below 
or, preferably, in polymerase reaction buffer (Promega, Ml 95 A) heated to 95°C and then 
cooled to room temperature. As used herein, when the primer is referred to as partially or 
10 totally complementary to the target nucleic acid, that refers to the 3 '-terminal region of the 
probe (i.e. within about 10 nucleotides of the 3 ! -terminal nucleotide position). 

Reference herein to a stringency in relation to hybridization includes and encompasses 
from at least about 0 to at least about 15% v/v fonnamide and from at least about 1 M to at 

15 least about 2 M salt for hybridization, and at least about 1 M to at least about 2 M salt for 
washing conditions. Generally, low stringency is at from about 25-30°C to about 42°C. 
The temperature may be altered and higher temperatures used to replace fonnamide and/or 
to give alternative stringency conditions. Alternative stringency conditions may be applied 
where necessary, such as medium stringency, which includes and encompasses from at 

20 least about 16% v/v to at least about 30% v/v fonnamide and from at least about 0.5 M to 
at least about 0.9 M salt for hybridization, and at least about 0.5 M to at least about 0.9 M 
salt for washing conditions, or high stringency, which includes and encompasses from at 
least about 31% v/v to at least about 50% v/v formamide and from at least about 0.01 M to 
at least about 0.15 M salt for hybridization, and at least about 0.01 M to at least about 0.15 

25 M salt for washing conditions. In general, washing is carried out T m = 69.3 + 0.41 (G+C)% 
(Marmur and Doty, J. Mol Biol 5: 109,1962). However, the T m of a duplex DNA 
decreases by 1°C with every increase of 1% in the number of mismatch base pairs (Bonner 
and Laskey, Eur. J. Biochenu 46: 83, 1974). Formamide is optional in these hybridization 
conditions. Accordingly, particularly prefened levels of stringency are defined as follows: 

30 low stringency is 6 x SSC buffer, 0.1% w/v SDS at 25-42°C; a moderate stringency is 2 x 
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SSC buffer, 0.1% w/v SDS at a temperature in the range 20°C to 65°C; high stringency is 
0.1 x SSC buffer, 0.1% w/v SDS at a temperature of at least 65°C. 

The present invention provides, therefore, a methylation profile of the genome of a 
5 eukaryotic cell or group of cells, the methylation profile comprising the presence or 
absence of methylation at particular sites and/or junctions between methylated and 
unmethylated regions, said method comprising obtaining a sample of genomic DNA from 
said cell or group of cells, digesting a sub-sample of said sample of genomic DNA with a 
methylation-sensitive enzyme having a recognition nucleotide sequence comprising a 

10 nucleotide capable of methylation at one or more of the above-mentioned sites, subjecting 
said digested DNA to an amplification reaction using primers comprising a nucleotide 
sequence capable of annealing to a non-cleaved form of a cleavable nucleotide sequence of 
the methylation-sensitive enzyme recognition sequence and subjecting the products of the 
amplification reaction to separation or other detection means relative to a control but 

15 subjected to an amplification reaction using the same primers as for the digested DNA 
sample and then subjecting the products of the amplification reaction to the separation or 
other detection means, said control comprising another sub-sample of said sample of 
genomic DNA not subjected to digestion by the methylation-sensitive enzyme wherein the 
presence of amplification products in enzyme digested and non-digested samples is 

20 indicative of a methylation-sensitive enzyme-digestion-resistant marker, the absence and 
presence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a methylation-sensitive enzyme-digestion-sensitive marker 
and the presence and absence of amplification products in enzyme digested and undigested 
samples, respectively, is indicative of a methylation-sensitive enzyme-digestion-dependent 

25 marker. 

The present invention also contemplates kits for detennining the methylation profile of one 
or more nucleotides at one or more sites within the genome of a eukaryotic cell or group of 
cells. The kits may comprise many different forms but in a preferred embodiment, the kits 
30 are designed to create products capable of resolution via electrophoresis. 



PCT/AU02/01262 



BNSDOCID: <WO 0302521 5A1J_> 



WO 03/025215 



o 

PCT/AU02/01262 



-26- 

The kit may also comprise instructions for use. 

Accordingly, another aspect of the present invention contemplates an assay device in the 
form of a kit useful for determining the methylation profile of one or more nucleotides at 
5 one or more sites within the genome of a eukaryotic cell or group of cells, said kit 
. comprising in compartmental form multiple compartments each adapted to comprise one 
* or more of buffers, diluents and enyzmes as single or multiple components which are 
optionally required to be admixed prior to use, said kit further comprising instructions for 
use, wherein the method is conducted by obtaining a sample of genomic DNA from said 

10 cell or group of cells, digesting a sub-sample of said sample of genomic DNA with a 
methylation-sensitive enzyme having a recognition nucleotide sequence comprising a 
nucleotide capable of being methylated at one or more of the above-mentioned sites, 
subjecting said digested DNA to an amplification reaction using primers comprising a 
nucleotide sequence capable of annealing to a non-cleaved form of a cleavable nucleotide 

15 sequence of the methylation-sensitive enzyme recognition sequence and subjecting the 
products of the amplification reaction to separation or other detection means relative to a 
control but subjected to an amplification reaction using the same primers as for the 
digested DNA sample and then subjecting the products of the amplification reaction to the 
separation or other detection means, said control comprising another sub-sample of said 

20 sample of genomic DNA not subjected to digestion by the methylation-sensitive enzyme, 
wherein the presence of amplification products in enzyme digested and non-digested 
samples is indicative of a methylation-sensitive enzyme-digestion-resistant marker, and 
wherein the absence and presence of amplification products in enzyme digested and 
undigested samples, respectively, is indicative of a methylation-sensitive enzyme- 

25 digestion-sensitive marker and wherein the presence and absence of amplification products 
in enzyme digested and undigested samples, respectively, is indicative of a methylation- 
sensitive enzyme-digestion-dependent marker. 

More particularly, another aspect of the present invention contemplates an assay device in 
30 the form of a kit useful for determining the methylation profile of one or more nucleotides 
at one or more sites within the genome of a eukaryotic cell or group of cells, said kit 
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comprising in compartmental form multiple compartments each adapted to comprise one 
or more of buffers, diluents and enyzmes as single or multiple components which are 
optionally required to be admixed prior to use, said kit further comprising instructions for 
use, wherein the method is conducted by obtaining a sample of genomic DNA from said 
5 cell or group of cells, digesting a sub-sample of said sample of genomic DNA with HpaR 
which has a recognition nucleotide sequence corresponding to or within said sites, 
subjecting said digested DNA to an amplification reaction using primers comprising a 
nucleotide sequence capable of annealing to a non-cleaved form of a HpaU cleavable 
nucleotide sequence and subjecting the products of the amplification reaction to separation 

10 or other detection means relative to a control but subjected to an amplification reaction 
using the same primers as for the digested DNA sample and then subjecting the products of 
the amplification reaction to the separation or other detection means, said control 
comprising another sub-sample of said sample of genomic DNA not subjected to digestion 
by HpaU, wherein the presence of amplification products in enzyme digested and non- 

15 digested samples is indicative of a flpall-digestion-resistant marker (H*), the absence and 
presence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a iHpall-digestion-sensitive marker (H s ) and the presence and 
absence of amplification products in enzyme digested and undigested samples, 
respectively, is indicative of a flpall-digestion dependent marker (H d ). 

20 

Conveniently, the kits are adapted to contain compartments for two or more of the above- 
listed components. Furthermore, buffers, nucleotides and/or enzymes may be combined 
into a single compartment. 

25 The Taq polymerase is an example of a suitable DNA polymerase which is thermostable 
and which is used in PCR and other amplification reactions. The thermostable DNA 
polymerase is used in an amount sufficient for a hybridized probe to release an identifier 
nucleotide. This amount may vary with the enzyme used and also with the temperature at 
which depolymerization is carried out. An enzyme of a kit is typically present in an 

30 amount sufficient to permit the use of about 0.1 to 100 U/reaction; in particularly preferred 
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ernbodiments, the concentration is about 0.5 U/reaction. The Stoffel fragment is another 
example of a suitable DNA polymerase and is in fact preferred over Taq for AMPs. 

As stated above, instructions optionally present in such kits instruct the user on how to use 
5 the components of the kit to perform the various methods of the present invention. It is 
contemplated that these instructions include a description of the detection methods of the 
subject invention, including detection by gel electrophoresis. 

The present invention further contemplates kits which contain a primer for a nucleic acid 
10 target of interest with the primer being complementary to a predetermined nucleic acid 
target, such as comprising the complementary sequence or sense sequence of a HpaTL 
recognition sequence. In another embodiment, the kit contains multiple primers or probes, 
each of which contains a different base at an interrogation position or which is designed to 
interrogate different target DNA sequences. In a contemplated embodiment, multiple 
15 probes are provided for a set of nucleic acid target sequences that give rise to analytical 
results which are distinguishable for the various probes. 

It is contemplated that a kit comprises a vessel containing a purified and isolated enzyme 
whose activity is to release one or more nucleotides from the 3' terminus of a hybridized 

20 nucleic acid probe and a vessel containing pyrophosphate. In one embodiment, these items 
are combined in a single vessel. It is contemplated that the enzyme is either in solution or 
provided as a solid (e.g. as a lyophilized powder); the same is true for the pyrophosphate. 
Preferably, the enzyme is provided in solution. Some contemplated kits contain labeled 
nucleic acid probes. Other contemplated kits further comprise vessels containing labels and 

25 vessels containing reagents for attaching the labels. Microtiter trays are particularly useful 
and these may comprise from two to 100,000 wells or from about six to about 10,000 wells 
or from about six to about 1,000 wells. 

The methylation profile of a cell's or group of cells' genome may be in any location 
30 including promoter regions, 5' regions prior to the authentic transcription start site, within 
coding (i.e. exon) and non-coding (i.e. intron) regions of genes and in the 3' regions 
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following the authentic termination signal. The methylome may also be in regulatory 
genes, structural genes and in non-protein-encoding genes. 

The present invention has a range of utilities. For example, a number of cancers have been 
5 detected in which previously silenced genes (such as silenced at birth) are now expressed. 
A change in methylation profile, especially if mapped to cancer-associated genes, provides 
a diagnostic application for the AMP technology. Indeed, many complex medical disorders 
are likely to have a methylation component. For example, during hypertrophy in heart 
disease, there is a reappearance of expression of juvenile-expressed genes. This is 

10 presumably due to changes in methylation profile. In addition, methylation profiles may be 
implicated in the aging process as well as in cell development such as the development of 
stem cells (e.g. ES cells, post-natal stem cells) or EG cells to mature cells or cells of a 
developmental state in between ES cells and mature cells. Still further, the extent or type of 
methylation may be predictive of the sensitivity or otherwise of a cell's or group of cells* 

15 sensitivity to viruses or to transposon mutagenesis or transgene expression. 

In a related embodiment, changes in methylation profile may, in one embodiment, be 
associated with RNAi. It is proposed that RNAi is induced by double-stranded RNA 
whether naturally occurring or artificially produced via co-suppression constructs. A 
20 change in methylome profile may be a result of a change in RNAi-specific modulation of 
gene expression. The detection of a change in methylome profile may provide a target for 
gene therapy. Furthermore, a change in methylation profile may be directly or indirectly 
induced. 

25 As indicated above, methylation may also be a key element in the aging process. 
Modification of methylation profiles may delay the onset of or ameliorate the effects of 
age-related conditions and illnesses. Support for this proposition is in the change in 
methylation profiles in identical twins. The detection of age-related methylation profiles or 
polymorphisms is useful, for example, in forensic medicine and testing of age of animals 

30 in the livestock industry. 
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The methylation profiling assay of the present invention is particularly useful, therefore, in 
monitoring aging of cells and may be used in conjunction with pallative care or in clinical 
management of individuals having the methylation profile of a cell or group of cells at a 
particular stage of development. This is important not only in human and veterinary health 
but also in monitoring plants and trees, especially those generated through tissue culture. 

Another important application is in the high throughput screening of agents which are 
capable of demethylating genomes. This may be important, for example, in de- 
differentiating cells. 

In relation to these embodiments, genetic constructs such as used, for example, in gene 
therapy, can be engineered to attach cell-specific methylation modulation signals to ensure 
that genes are regulated in the appropriate manner. 

In the livestock industry, such as meat production, wool production and breeding 
programs, amongst many other facets, monitoring methylome profiles in relation to 
environmental conditions (e.g. drought, weather, change in diet, etc.) or following 
vaccination programs will be important for ensuring there is no loss of quality or decrease 
in production levels. This is also important for general toxicity testing of humans and 
animals. Many disease conditions are mediated by toxins and environmental contaminants 
and AMP technology provides a means for monitoring for changes in methylome profile. 

Accordingly, another aspect of the present invention contemplates a method for detecting a 
change in a cell's or group of cells' developmental state or a cell's or group of cells' 
exposure to an internal or external stimulus, said method comprising detecting a change in 
methylation profile over time by the method comprising obtaining a sample of genomic 
DNA from said cell or group of cells, digesting a sub-sample of said sample of genomic 
DNA with a methylation-sensitive enzyme having a recognition nucleotide sequence 
comprising a nucleotide capable of being methylated at one or more of the above- 
mentioned sites, subjecting said digested DNA to an amplification reaction using primers 
comprising a nucleotide sequence capable of annealing to a non-cleaved form of a 
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cleavable nucleotide sequence of the methylation-sensitive enzyme recognition sequence 
and subjecting the products of the amplification reaction to separation or other detection 
means relative to a control but subjected to an amplification reaction using the same 
primers as for the digested DNA sample and then subjecting the products of the 
5 amplification reaction to the separation or other detection means, said control comprising 
another sub-sample of said sample of genomic DNA not subjected to digestion by the 
methylation-sensitive enzyme, wherein the presence of amplification products in enzyme 
digested and non-digested samples is indicative of a methylation-sensitive enzyme- 
digestion-resistant marker, and wherein the absence and presence of amplification products 
10 in enzyme digested and undigested samples, respectively, is indicative of a methylation- 
sensitive enzyme-digestion-sensitive marker and wherein the presence and absence of 
amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-dependent marker. 

15 More particularly, the present invention provides a method for detecting a change in a 
cell's or group of cells' developmental state or a cell's or group of cells' exposure to an 
internal or external stimulus, said method comprising detecting a change in methylation 
profile over time by the method comprising obtaining a sample of genomic DNA from said 
cell or group of cells, digesting a sub-sample of said sample of genomic DNA with HpaU 

20 which has a recognition nucleotide sequence corresponding to or within said sites, 
subjecting said digested DNA to an amplification reaction using primers comprising a 
nucleotide sequence capable of annealing to a non-cleaved form of a HpaJL cleavable 
nucleotide sequence and subjecting the products of the amplification reaction to separation 
and/or detection means relative to a control but subjected to an amplification reaction using 

25 the same primers as for the digested DNA sample and then subjecting the products of the 
amplification reaction to the separation or other detection means, said control comprising 
another sub-sample of said sample of genomic DNA not subjected to digestion by HpaU 
wherein the presence of amplification products in enzyme digested and non-digested 
samples is indicative of a ifpall-digestion-resistant marker (H 1 ), the absence and presence 

30 of amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a i^aE-digestion-sensitive marker (H s ) and the presence and absence of 
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amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a iipaE-digestion-dependent marker (H d ). 

In a further embodiment, the present invention provides a method for detecting a change in 
5 a cell's or group of cells' physiological status such as from a normal state to a disease 
state, said method comprising obtaining a sample of genomic DNA from said cell or group 
of cells, digesting a sub-sample of said sample of genomic DNA with a methylation- 
sensitive enzyme having a recognition nucleotide sequence comprising a nucleotide 
capable of being methylated at one or more of the above-mentioned sites, subjecting said 

1 0 digested DNA to an amplification reaction using primers comprising a nucleotide sequence 
capable of annealing to a non-cleaved form of a cleavable nucleotide sequence of the 
methylation-sensitive enzyme recognition sequence and subjecting the products of the 
amplification reaction to separation or other detection means relative to a control but 
subjected to an amplification reaction using the same primers as for the digested DNA 

15 sample and then subjecting the products of the amplification reaction to the separation or 
other detection means, said control comprising another sub-sample of said sample of 
genomic DNA not subjected to digestion by the methylation-sensitive enzyme, wherein the 
presence of amplification products in enzyme digested and non-digested samples is 
indicative of a methylation-sensitive enzyme-digestion-resistant marker, and wherein the 

20 absence and presence of amplification products in enzyme digested and undigested 
samples, respectively, is indicative of a methylation-sensitive enzyme-digestion-sensitive 
marker and wherein the presence and absence of amplification products in enzyme 
digested and undigested samples, respectively, is indicative of a methylation-sensitive 
enzyme-digestion-dependent marker. 

25 

In yet another embodiment, the present invention provides a method for detecting a change 
in a cell's or group of cells' physiological status such as from a normal state to a disease 
state, said method comprising detecting a change in methylation profile over time by the 
method comprising obtaining a sample of genomic DNA from said cell or group of cells, 
30 digesting a sub-sample of said sample of genomic DNA with HpaTL which has a 
recognition nucleotide sequence corresponding to or within said sites, subjecting said 
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digested DNA to an amplification reaction using primers comprising a nucleotide sequence 
capable of annealing to a non-cleaved form of a HpaK cleavable nucleotide sequence and 
subjecting the products of the amplification reaction to separation or other detection means 
relative to a control but subjected to an amplification reaction using the same primers as 

5 for the digested DNA sample and then subjecting the products of the amplification reaction 
to the separation or other detection means, said control comprising another sub-sample of 
said sample of genomic DNA not subjected to digestion by HpaU wherein the presence of 
amplification products in enzyme digested and non-digested samples is indicative of a 
i*paII-digestion-resistant marker (H 1 ), the absence and presence of amplification products 

10 in enzyme digested and undigested samples, respectively, is indicative of a Hpaftr 
digestion-sensitive marker (H s ) and the presence and absence of amplification products in 
enzyme digested and undigested samples, respectively, is indicative of a Z/pall-digestion- 
dependent marker (H d ). 

1 5 Preferably, the amplification reaction is PCR. 

Disease conditions associated with methylation include but are not limited to Rett 
syndrome, ICF and Fragile X syndrome. Furthermore, the change in methylation profile 
can correlate with the change in expression patterns of particular genes and, hence, the 
20 assay of the present invention is a useful way of identifying new genes (or promoters or 
other genetic sequences) or for functionalizing a known gene or genetic sequence). 

A change in methylome profile over time means that at least two time points are measured 
and there is a change in methylation profile over at least the two time points. 

25 

The present invention further permits the identification of genes and promoters having 
CpG or CpNpG islands which have HpaR sites or other methylation-sensitive restriction 
sites. Furthermore, application of the AMP technology permits identification of Class m 
markers (i.e. B 6 sites). The identification of these sites permits identification of potential 
30 regulatory regions which can be targeted for agonists or antagonists of gene expression. 
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Tfre present invention further contemplates mapping methylome profiles with phenotypic 
and/or genotypic profiles. A database of methylomes can then be produced relative to a 
phenotypic trait or a particular genotype. Such a database is useful in bioinformatics to 
establish likely phenotypes of, for example, animal, mammalian, avian, insect, reptillian, 
5 amphibian, fungal, yeast, nematode or other cells of a eukaryotic organism. Furthermore, 
the database may also be used to map junctions between methylated and non-methylated 
DNA and their association with important biological phenotypes.. The latter frequently 
show high levels of cell specificity. The present invention further contemplates any method 
for monitoring these junctions and the association of these junctions with important 
10 biological phenotypes. 

Methylation profiles are also important in plant tissue culture and/or in recombinant 
protocols applied to plants. 

15 Thus, in another aspect, the invention contemplates a computer program product for 
assessing the likely phenotype of a cell based on methylome profile, said product 
comprising:- 

(1) code that receives an input value for one or more of features wherein said features 
20 are selected from:- 



25 



(a) 
(b) 
(c) 
(d) 
(e) 
(f> 
(g) 



absence or presence of AMP marker Class I; 

absence or presence of AMP marker Class II: 

absence or presence of AMP marker Class HI; 

absence or presence of phenotype mapped to a Class I marker; 

absence or presence of phenotype mapped to a Class n marker; 

absence or presence of phenotype mapped to a Class m marker; and 

absence or presence of a junction between methylated and unmethylated 

DNA; and 

absence or presence of phenotype mapped to a junction between methylated 
and unmethylated DNA. 



30 



(h) 



JNSDOC1D: <WO 0302521 5A1 J_> 



WO 03/025215 PCT/AU02/01262 



35- 



(2) a computer readable medium that stores the code. 

Yet another aspect of the invention extends to a computer system for assessing the likely 
5 phenotype of a cell based on methylation profile wherein said computer system 
comprises:- 

(1) a machine-readable data storage medium comprising a data storage material 
encoded with machine-readable data, wherein said machine-readable data comprise 

10 values for one or more features, wherein said features are selected from:- 

(a) absence or presence of AMP marker Class I; 

(b) absence or presence of AMP marker Class II: 

(c) absence or presence of AMP marker Class HI; 

1 5 (d) absence or presence of phenotype mapped to a Class I marker; 

(e) absence or presence of phenotype mapped to a Class II marker; 

(f) absence or presence of phenotype mapped to a Class m marker; 

(g) absence or presence of a junction between methylated and unmethylated 
DNA; and 

20 (h) absence or presence of phenotype mapped to a junction between methylated 

and unmethylated DNA; 

(2) a working memory for storing instructions for processing said machine-readable 
data; 

25 

(3) a central-processing unit coupled to said working memory and to said machine- 
readable storage medium, for processing said machine-readable data to provide 
comparison of phenotype and AMP marker classes; and 

30 (4) an output hardware coupled to said central processing unit for receiving said data of 
comparison. 
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The present invention further extends to searching for homologous AMP markers, once 
identified. This then provides a basis for identifying similar control mechanisms in other 
genes, promoters or regulatory regions. For example, tissue-specific AMP markers or 
5 AMP markers associated with a disease are cloned, sequenced and mapped on the genome. 
Coding sequences adjacent or limited to the mapped AMP marker are identified. Genome- 
wide microarray analysis is conducted with cDNA probes from different tissues or from 
normal versus diseased cells to reveal other coding sequences that are coordinately 
regulated with the coding sequence linked to the said tissue or disease-specific AMP 

10 marker. Computer algorithms are then used to identify conserved nucleotide signatures 
common to all coordinately expressed genes, which in turn are correlated to the 
methylation status of an AMP marker in the vicinity of each coding sequence. The 
discovered nucleotide signature can then be used to discover further coordinately regulated 
coding sequences around the genome regulated by methylation status of an adjacent AMP 

15 marker. 

A combined AMP analysis approach, or AMP analysis on individuals or tissues of 
individuals, is adopted to identify DNA methylation markers linked or associated with 
important phenoypes such as disease, a diseased tissue (e.g. cancer), aging or exposure to a 

20 toxin (e.g. excessive alcohol or smoke). For the combined AMP analysis approach, DNA is 
bulked for normal versus affected individuals or tissues. AMP analysis and comparison 
between normal and affected DNA bulks identifies candidate diagnostic markers 
associated with the phenotype. Promising markers identified by this approach are then 
tested on a large number of normal versus affected individuals or tissues. When a marker is 

25 identified to be always or almost always or very often associated with the phenotype, the 
marker will be cloned and sequenced, and a diagnostic test developed. The AMP is also be 
mapped on the genome to identify coding or non-coding genes linked to the methylation 
change in affected individuals or tissues. These linked coding or non-coding genes or the 
encoded proteins are identified as therapeutic targets for the development of drugs or 

30 therapy. 
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In cases where the gene is methylated and silenced in affected individuals or tissues, 
compounds are screened in high throughput fashion in stable cell lines or individuals to 
identify drugs that result in demethylation and reactivation of the affected gene. 
Alternatively, a normal active copy of the affected gene are transfected as a transgene into 
5 cells to correct the defect. Such transgenes are introduced with modulating sequences that 
protect the transgene from methylation and keep it unmethylated and transcriptionally 
active. 

In cases where the gene is unmethylated and transcriptionally active or transcriptionally 
10 over-active in affected individuals or tissues, compounds are screened in high throughput 
fashion in stable cell lines to identify drugs that result in methylation and silencing of the 
affected gene. Alternatively, a transgene encoding a double stranded RNA homologous to 
the affected AMP or homologous to sequences linked to the AMP, or double stranded 
RNA homologous to the AMP or homologous to sequences linked to the AMP, are 
1 5 transfected as a transgene into cells to methylate the gene, silence it and thereby correct the 
defect. Such double stranded RNA-encoding transgenes are introduced with modulating 
sequences which protect it from methylation, keep it transcriptionally active and producing 
double stranded RNA. 

20 The present invention is further described by the following non-limiting Examples. 
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JU^tXlfXr ±jJL JL 

AMP protocol 

The following protocol may need to be adapted for use in particular plant or animal cells. 
5 However, in general, the procedure involves isolating genomic DNA, subjecting the DNA 
to HpaU digestion and subjecting the digest to an amplification reaction using a primer 
which spans 

all or part of the HpaU recognition sequence, i.e. CCGG, such that if HpaU has digested 
10 t 

the nucleotide sequence, primer extension cannot occur. 
A diagram showing the AMP protocol is shown in Figure 1. 

15 Three classes of product are expected: - 

Class I : HpaU digestion-resistant (H 1 ) markers, indicative of methylation 

Class II : HpaU digestion-sensitive (H s ) markers, indicative of non- 
20 methylation . 

Class HI : HpaU digestion-dependent (H d ) markers. Amplicon is methylated or 

partly methylated and is linked to an unmethylated HpaU site 
followed by an inhibitor sequence. Cleavage of the unmethylated site 
25 removes inhibition and allows amplification. 

The existence of an H d (Class IU) marker is further shown in Figure 2. An H d marker 
represents a junction between methylated and unmethylated DNA. Figure 3 provides an 
example of the different classes of AMP markers. 

30 

The following protocol is useful for most cells such as from humans, sugarcane and wheat. 
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The DNA marker protocols are based on arbitrarily-primed PCR, using high 
concentrations (5 |xM) of small primers, ten nucleotides in length. PCR is performed with a 
hot start (85°C), an initial denaturation at 94°C for 5 min, followed by 30 cycles of: 94°C 
for 30 sees, 60 sec at each of 57°C, 56°C, 55°C, 54°C and 53°C and a final extension step 
5 at 72°C for 5 min. Highly reproducible RAFs were amplified from either purified DNA or 
alkali treated leaf tissue (Klimyuk et al, Plant J. 3: 493-494, 1993) to demonstrate DNA 
sequence change between sugarcane lines. The amplified DNA methylation 
Eolymorphisms (AMP) protocol is similar to the RAF protocol, but it uses genomic DNA 
digested with methylation sensitive enzymes before it is used as template in PCR using 
1 0 oligonucleotide primers with recognition sequences for the methylation-sensitive enzyme. 

The protocol useful for eukaryotic cells such as from humans, sugarcane or wheat is as 
follows. 

15 The method is a PCR-based DNA marker protocol for detecting AMP markers in genomic 
DNA. AMP markers are detected by using oligonucleotide primers carrying recognition 
sequences for methylation-sensitive restriction endonucleases on genomic template 
digested with the corresponding enzyme. The procedure listed below is for use of the 
methylation-sensitive enzyme Hpall and corresponding HpaJL oligonucleotide primers, but 

20 the protocol can be modified accordingly for all other methylation-sensitive enzymes. 

Template preparation: 

1 . Digest 5.0 \xg of high quality genomic DNA with the methylation sensitive enzyme 
25 (e.g. HpaU ) [overnight digestion is convenient]. 
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Dizesi Reaction (1GG uL) 

1 0 x enzyme buffer 1 0 uL 

HpaU (10 U/ ill) 5uL 

DNA 5 ug 

sterile dH 2 0 to final Vol. 100 uL 

2. Run 250 ng (5 uL) of digest on 0.7% w/v agarose gel to check digestion. 

3. Remove 250 ng (5 (J.L) and keep for later (to compare with recovery after clean- 
up). 

4. Precipitate digested DNA with 0.1 Vol. 3 M Na Acetate (pH 6), 2 Vol. 100% v/v 
EtOH, 1 uE (20 (J.g/|J.l) glycogen. Mix gently, leave on bench for 10 to 15 mins. 

5. Microfuge at 4°C (13000 rpm) 30 mins. Immediately after spinning, remove the 
supernatant and wash pellet with 70% v/v EtOH. Remove EtOH and dry pellet ( 2 
mins in Speedie Vac or air-dry at room temp in laminar flow). 

6. Resuspend pellet in 50 uE sterile dH 2 0 (ca. 100 ng/uL). To check recovery of the 
clean-up run 2.5 uE (ca. 250 ng) on gel alongside the 250 ng kept from step 3. 
Estimate recovery and dilute samples to 50 ng/uL for use in the AMP reaction. 



A PCR procedure is then conducted using the following primers:- 



HpaU Oligonucleotide Primers (Oi 


peron) 


Oligo 


5'-3* ' ■ 


SEQIDNO: 


B-16 


TTTGCCCGGA 


1 


C-08 


TGGACCGGTG 


2 


D-20 


ACCCGGTCAC 


3 


F-17 


AACCCGGGAA 


4 


F-18 


TTCCCGGGTT 


5 
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HvaH Oligonucleotide Primers (O 


>eron) 




5'-3' 


SEO ID NO: 


1-08 


TTTGCCCGGT 


6 


J-01 


CCCGGCATAA 


7 


J-14 


CACCCGGATG 


8 


M-17 


TCAGTCCGGG 


9 


N-09 


TGCCGGCTTG 


10 


P-05 


CCCCGGTAAC 


11 


v - 1 


CAGTGCCGGT 


12 


V X / 


ACCGGCTTGT 


13 


VV UJ 


GTCCGGAGTG 


14 


W-1 S 


ACACCGGAAC 


15 


AFU1 6 


CCCGGATGGT 


16 


AF-1 1 


AAGACCGGGA 


17 


AF-16 


TCCCGGTGAG 


18 


AT-1 S 


GAATCCGGCA 


19 


AK"-1R 

/VJV." i. o 


ACCCGGAAAC 


20 


AM-09 


TGCCGGTTCA 


21 


AN-14 


AGCCGGGTAA 


22 


AP-20 


CCCGGATACA 


23 


AQ-16 


CCCGGAAGAG 


24 


AR-18 


CTACCGGCAC 


25 


AT-10 . 


ACCTCCGGTC 


26 


AV-14 


CTCCGGATCA 


27 


AY-03 


TTTCCGGGAG 


28 


BB-09 


AGGCCGGTCA 


29 


BB-18 


CAACCGGTCT 


30 


BF-13 


CCGCCGGTAA 


31 


BG-17 


TCCGGGACTC 


32 



Useful PCR stock solutions include:- 

5 1 . 10 x DAF buffer = 100 mM Tris pH 8, 100 mM KC1, 50 mM MgCl 2; 

2. 1 mM dNTPs; 

3. a-labelled 33 P-dATP, 10 uCi/uL (as from supplier); and 

4. 10 nM 1 0-mer primer stocks. 
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The PGR procedure is conducted as foiiows:- 



PCR is conducted in a 10 uL volume, with a single oglionucleotide primer, 10 nucleotides 
long (see above). 

5 



Reaction Mix 


1 reaction 


10 reactions 


Final concentration 


Stoffel fragment 10 U/iiL 


0.15 uL 


1.5 |xL 


1.5 U/10 \iL 


10 xDAF buffer 


1.0 


10.0 uL 




dNTPs (1 mM) 


0.2 uL 


2.0 uL 


20 uM 


Sterile dH 2 0 


2.4 |xL 


24.0 \iL 




a-labeled 33 P-dATP (10 uCi/nL) 


0.25 uL 


2.5 \xL 


2.5 uCi/10 uL 


Total 


4.0 uLi 


40.0 (Jl. 





Aliquot into each PCR tube 4 uL of reaction mix, then add 5 uL primer (final 
concentration 5 uM) and 1 ul 50 ng/uL digested or genomic DNA. 

10 Prepare reactions on ice and load tubes into the PCR machine with the block at 85 °C (i.e. 
hot start). 

The following conditions are employed:- , 



15 1. 94°C 5min 

2. 94°C 30 sec 

3. 57°C 1 min 

4. 56°C 1 min 

5. 55°C 1 min 
20 6. 54°C lmin 

7. 53°C 1 min 

8. go to step (2) 29 more times 

9. 72°C 5 min 
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15 



20 



10. end. 



The protocol was slightly modified for the large wheat genome, in that the 
annealing/extension temperatures were 2°C higher, i.e. 



30 cycles of 94°C for 5 min 
94°C for 30 sec 
59°C for 1 min 
58°C for 1 min 
10 57°C for 1 min 

56°C for 1 min 
55°C for 1 min. 



Furthermore, the following changes were also made:- 

1. Add 10 yiL of 2x loading buffer (0.0% w/v Bromophenol Blue, 0.05% Xylene 
Cyanol, 98% v/v formamide, 10 mM EDTA, pH 8) to each PCR sample, mix 
gently. Before gel electrophoresis, denature PCR reactions at 90°C for 3 mins, cool 
on ice. 

2. Separate the products on sequencing gels (4% w/v polyacrylamide gel, 7.5 M urea 
in Ix TBE, run atlOO W, 2 hr 15 mins). Dry gel and expose to film for 6 hours or 
overnight 

25 The extent of DNA methylation change was determined in 24 sugarcane lines cloned by 
tissue culture. All 24 lines investigated had widespread DNA methylation changes and 
unique AMP profiles. An average of 0.8% DNA methylation polymorphism over more 
than 1,000 markers was observed between the parent variety and somaclones and both 
increases and decreases in DNA methylation were observed. DNA methylation 

30 polymorphism was also demonstrated in conventionally propagated plants, but the level 
was about two-fold greater (p < 0.05) in tissue culture propagated plants. 
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Clearly, DNA methylation change predisposed sugarcane clones to genome instability as 
about 20% of non-transgenic somaclones lost DNA as measured by randomly amplified 
DNA fingerprinting protocols (RAFs) and this finding was confirmed with simple 
5 sequence repeat (SSR) markers. Up to 2% of RAF markers were lost in a selection of 
transgenic lines of sugarcane. Genetic mapping of DNA markers susceptible to loss 
showed that they were often deleted as a linkage group. The inventors detected an example 
of chromosome breakage as distinct from chromosome loss in one transgenic line. 
Interestingly, there was a significant correlation between loss of markers and reduced yield 
10 in preliminary field trials on transgenic lines (p = 0.017). 

This example showed that a minimal passage through tissue culture altered the methylation 
status of about 1% of the sugarcane genome, as randomly detected by the AMP 
technology. Transposon activity can lead to chromosome loss and/or breakage (e.g., see 

15 English et al, Plant Cell 5: 501-514, 1993). Thus, the increased DNA methylation change 
in somaclones as shown here may have direct effects on gene expression as well as 
predisposing the sugarcane genome to instability. The AMP technologies in accordance 
with the present invention may usefully be applied as diagnostic tests to identify tissue 
culture conditions that minimize DNA methylation change and loss of DNA during the 

20 tissue culture phase of genetic engineering of plants and animals. 



25 Using the AMP technology, abundant and distinct tissue-specific DNA methylation 
polymorphisms were readily detected between human blood and sperm (Figure 4). Most of 
the tissue specific DNA methylation polymorphisms were Class III digestion dependent 
markers and surprisingly, 30% of these were polymorphic in methylation status between 
these tissues. Between five blood samples and three sperm samples (from eight separate 

30 individuals), 80% of variation in AMP profiles generated from i^pall-digested template 



EXAMPLE 2 



Detection of DNA polymorphisms in human body fluid 
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could be explained by the tissue type, and the remaining 20% of variation could not be 
distinguished from nucleotide (rather than DNA methylation) polymorphism. 

In another example, blood from three sets of young, healthy, identical twins were also 
5 assessed for DNA methylation polymorphisms over 8,000 CpG sites in the genome, 
representing an average of about 340 CpG sites per human chromosome. There was 100% 
concordance of AMP profiles of identical twins thereby indicating stringent genetic 
programming of DNA methylation patterns in human cells to produce cell-specific 
"methylomes". Interestingly, an AMP marker was detected that showed an age-dependent 
10 methylation change. This has implications in targets for therapy of age-related conditions 
or illnesses and in forensic medicine. 

EXAMPLE 3 
Identification of Class III markers in human genome 

15 

Figure 5 is a diagrammatic representation showing map location of four human digestion- 
dependent Class HI AMP markers and the flanking HpalL sites. Each of the digestion- 
dependent markers that was cloned and sequenced was 100% homologous to sequenced 
regions of the human genome. The positions of the Class HI amplicons (HdM 1-4) [SEQ 

20 ID NOS:33-36] are indicated by thick shaded boxes on the chromosomal segment. The 
locations of HpaU sites are represented by vertical lines and the distance to, and orientation 
of nearby genes are indicated. Southern analysis has shown that Class HI markers represent 
the junction between methylated and unmethylated DNA in the genome. Tissue-specific 
Class EI AMP markers often map to the CpG islands and often represent tissue-specific 

25 modification of methylation in or around CpG islands. 
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n:XATviFJLii; 4 



Identification of promoter elements that contain CpG islands 
in human genomic DNA 



10 



15 



20 



25 



30 



35 
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Human promoter elements that contains CpG islands are identified by searching the human 
genome database and selecting promoter element sequences that comprise regions of DNA 
which are approximately 180 bp in length and that have a sequence complexity comprising 
a G-f-C content above 50%. An example of such a promoter element comprising a CpG 
island is rpA2 promoter element (shown below):- 

caggcgcccg ccaccacacc cagctaattt tttatatttt tagtagagat ggggtttcac 60 
cgtgttagcc agcatggtct tgatctcctg atctcgtgat cagggaaatc tgtttaatta 120 
aaacacgctt tgcgatttct gcccagtgct ctgaatgtca aagtcaagag taggaggaag 180 



ggjtgtctatc 


ttacttaacc 


attgaatccc 


agtggccagc 


acagggccag 


gaacaaagaa 


600 


ggcataaaca 


attctttttt 


tttttttttt 


ttttttttga 


gtcgcagtct 


ctctctgcca 


660 


cctaggctgg 


agtgcagtgg 


cagcatcaca gctcactgca 


gcctggacac 


cccagggcca 


720 


ggtgatcctc 


ccaccttagc 


caccccagta 


gctgggaccc 


caggcacttg 


ccaccacacc 


780 


agactaattt 


ttaaaaatat 


tttttgagag 


agggtctcac 


tatgttgtcc 


aggctggtct 


840 


caaacttcca 


gcctcaagcg 


gtcctcctgc 


ctcagacccc 


atttgctggg 


tttacaggca 


900 


tgagccacag 


cacctgctaa 


tttttcttaa 


atacataaat 


gaacataaaa 


ttctaacaat 


960 


gcatgagtat 


tttgaggaag 


gaactgacaa 


aatgttccac 


tccctatggg 


aggcacgtta 


1020 


tatgaagaat 


tatgaaaaat 


ggtcgaaatg 


actggagagg 


ccaagcctgg 


atgagactgg 


1080 


gatggggaca 


ggtgcgggac 


gaggggcacc 


accctcacat 


ctttcacaag tctgtcatag 


1140 


gcaagagggc 


gtaggtttct 


cacagcccca 


ctggggagaa 


tcggcaccat 


tggtggcatt 


1200 


acacgaagag 


aatgtgacct 


cctatgtaaa 


agaacaagca 


actccacgcg gtgctgtgag 


1260 


gctagtgctg 


cgagtccctg 


aggtgcgcaa 


ttcccgcacg 


accgtgggtg 


ggaaacaccg 


1320 


aagccaaaac 


tccgctacag 


ccctttagat 


gaaggcgtcg 


tctgattggt gatagtttgg 


1380 


cgcgaacctg 


agcacgccga 


acaaaggaag 


tgacggcaga 


agtcgcgcac 


ttgacgaggg 


1440 


tgggatcaca 


cggcgctgcg 


tcgcggtagt attgttctga 


ttggttgatt 


tcttgcgata 


1500 


ccgetctgcc 


agccccttgc 


ttccgctagt .gcggagggtt 


ttgcccttcg taaagatggc 


1560; 
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cgcggaggct tttggagcca actgggagcg cagtacgcgt tttctggagc atgggcagag 1620 
gagacaggaa caagcgtagc atccgtgagc accgattggc tgaagcgagc accccgggag 1680 
ctgactggct ccgccattcg cgggaaggcg tttgtggtgc cagagaaaag tagccagagc 1740 
ggcgcagtgg 1750 
5 [SEQ ID NO: 37] 

The CpG islands present in the promoter element are highlighted. The rpA2 promoter 
element contains two CpG islands, which are highlighted. CpG islands in this promoter 
element indicate that the expression of the gene is regulated by the methylation status of 
1 0 the promoter element. 

EXAMPLE 5 

Detection of promoter elements that contain CpG islands in plant genomic DNA 

15 A plant promoter element that contains CpG islands is identified by searching plant 
genome databases and selecting promoter element sequences that comprise regions of 
DNA which are at least 180 bp in length and that have a sequence complexity comprising a 
G+C content above 50%. An example of such a promoter element comprising a CpG 
island is the rbp21 promoter element derived from the rbp21 gene from Oryza saliva 

20 (shown below): 





ctgcagagag 


gatgaccctt 


gtcaccaccg 


tcatgtacga 


ggctgcttca 


ccactgcctc 


60 




actgccacca 


gcgtctcccg 


ccgcgtgcaa 


tacaagaaga 


aacatcgaac 


ggtcatataa 


120 




ggtaagaccc 


actaccgatt 


taacctatca 


ttcccacaat 


ctaatccact 


tatttctctt 


180 


25 


cccatgatct 


tatcctctca 


tttctcctca 


ctacttttgc 


atttgtagga 


aacacaatga 


240 




caccgtcgaa 


gaaagctggt 


ggagcaccgt 


agccagcaat 


caccaaaaca 


cagaggggag 


300 




gaggtcggca 


gcggccatgc 


ggacggcgat . 


gagacaacgc 


gacgcaaaga 


gggaggagga 


360 




cgttggcgat 


catgctggtg 


ttggcggagg 


aggtcactgg 


ccatgcgaat 


gacagcgggg 


420 




cagcgcaaca 


caaaaagggg 


ggaggatgcc 


ggcgaccacg 


ctagtaccat 


gaagcaagat 


480 


30 


gatgtgaaag 


ggaggaccgg 


acgagggttg 


gacctctgcc 


gccgacgtga 


agagcgtgat 


540 




gtgtagaagg 


agatgttaga 


ccagatgccg 


acgcaactta 


gccctgcaag 


tcacccgact 


600 




gcatatcgct 


gcttgccctc 


gtcctcatgt 


acacaatcag 


cttgcttatc 


tctccatact 


660 




tgtcgtttgt 


ttcccgtggc 


cgaaatagaa 


gaagacagag 


gtgggttttg 


ttggagagtt 


720 




ttagtggtat 


tgtaggccta 


tttgtaattt 


tgttgtactt 


tattgtatta 


atcaataaag 


780 


35 


gtgtttcatt 


ctattttgac 


tcaatgttga 


atccattgat 


ctcttggtgt 


tgcactcagt 


840 
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3.t y 1 1 dyadt 


attcatccey 


ttgaaacaat 


cctggctaag 


ggttggaaca 


fcttttatctg 


900 


ttcggtgaaa 


catccgtaat 


attttcgttg 


aaacaatttt 


tatccgacag 


caccgtccaa 


960 


caatttacac 


caatttggac 


gtgtgataca 


tagcagtccc 


caagtgaaac 


tgaccaccag 


1020 


ttgaaaggta 


tacaaagtga 


acttattcat 


ctaaaagacc 


gcagagatgg 


gccgtggccg 


1080 


tggctgcgaa 


acgacagcgt 


tcaggcccat 


gagccattta 


ttttttaaaa 


aaatatttca 


1140 


acaaaaaaga 


gaacggataa 


aatccatcga 


aaaaaaaaaa 


ctttcctacg 


catcctctcc 


1200 


tatctccatc 


cacggcgagc 


actcatccaa 


accgtccatc 


cacgcgcaca 


gtacacacac 


1260 


atagttatcg 


tctctccccc 


cgatgagtca 


ccacccgtgt 


cttcgagaaa 


cgcctcgcGC 


1320 


gacaccgtac 


gtgcgccacc 


gccgcgcctg 


ccgcctggac 


acgtccggct 


cctcteccgc 


1380 


cgcgc.tggcc 


accigtccacc 


ggctcccgca 


cacgtctccfc 


tgtctccctc". 


cacccat'gcc 


1440 


gtggcaatcg 


agctcatctc 


ctcgcctcct 


ccggcttata 


aatggcggcc 


accaccttca 


1500 


cctgcttgca 


[SEQ ID NO: 


38] 











The CpG islands present in the promoter element are highlighted. The rbp21 promoter 
15 element contains two CpG islands. The presence of a CpG island indicates that the activity 
of the promoter element is regulated by methylation. 

EXAMPLE 6 
Use of AMP to assess stable mammalian cell lines 

20 

Mammalian cell lines are used for many applications in medical research. It is important in 
many applications that cell lines used are stable in their phenotypes. AMP is used to 
identify stable versus unstable mammalian cell lines. AMP has been used to demonstrate 
that subcultures of the human colon carcinoma cell line HCT116 are very stable and 
25 identical in terms of DNA methylation and DNA sequence (Figure 8). Similarly, AMP can 
be used to monitor DNA methylation and genome stability in cloned animals. 

EXAMPLE 7 
Use of AMP to assess stable cell lines 

30 

Cell lines that are stable in terms of DNA methylation and DNA sequence can be used 
with the AMP protocol to identify and discover genes involved in control of DNA 
methylation and genome stability, or to identify processes that destabilize DNA 
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methylation and genome integrity. AMP has been used to demonstrate that knocking out 
the tumor suppressor gene p53 in the human colon carcinoma cell line HCT116 results in 
the induction of about 3% DNA methylation and 0.3% DNA sequence change to the 
genome as measured by AMP (Figure 9). These genomic changes could be due to loss of 
5 p53 function (indicating a key role for p53 in control of DNA methylation and 
maintenance of genome stability) or to the physical process of transfecting the cell line. 
Thus AMP can be used to identify and discover genes involved in control of DNA 
methylation and genome stability, and to identify processes or toxins that destabilize DNA 
methylation and genome integrity in mammals. 

10 

EXAMPLE 8 

Detection of DNA methylation polymorphisms in genomic DNA 

Southern analysis confirms that AMP detects DNA methylation polymorphisms in 
15 genomic DNA and the junctions of methylated and unmethylated DNA in the genome 
(Figure 10). 

EXAMPLE 9 
Combined AMP analysis 

20 

DNA from normal or abnormal tissue (e.g. cancer) is separately bulked. AMP analysis is 
conducted to compare DNA from the normal and abnormal tissue. Candidate fragments 
present or absent in abnormal tissue relative to normal tissue are then cloned and analyzed 
for subsequent development as a potential diagnostic marker for disease and/or as a 
25 potential therapeutic product itself or as a target for antagonists and agonists. In addition, 
methylation- or demethylation-facilitating sequences may be used to methylate or 
demethylate target DNA. 
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EXAMPLE 10 

Use of substractive hybridization to enrich for Hpall dependent markers 

With comparing genomes subjected to AMP analysis, given the large number of bands 
5 which are present, it is possible that HpaU digestion-dependent markers are masked by 
other bands. Subtractive amplification is used to increase the number of Spall-digestion 
dependent PCR detected in each PCR. 

The PCR-Select cDNA subtraction kit (Clontech, Palo Alto, USA) is conveniently used for 
10 the subtractive hybridization. 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
15 includes all of the steps, features, compositions and compounds referred to or indicated in 
this specification, individually or collectively, and any and all combinations of any two or 
more of said steps or features. 
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rf~»T A Til /TO 
1 

1 . A method for determining the methylation profile of one or more nucleotides at 

one or more sites within the genome of a eukaryotic cell or group of cells, said method 
comprising obtaining a sample of genomic DNA from said cell or group of cells, digesting 
a sub-sample of said sample of genomic DNA with a methylation-sensitive enzyme having 
a recognition nucleotide sequence comprising a nucleotide sequence capable of 
methylation at one or more of the above-mentioned sites, subjecting said digested DNA to 
an amplification reaction using primers comprising a nucleotide sequence capable of 
annealing to a non-cleaved form of a cleavable nucleotide sequence of the methylation- 
sensitive enzyme recognition sequence and subjecting the products of the amplification to 
separation or other detection means relative to a control, said control comprising another 
sub-sample of said sample of genomic DNA not subjected to digestion by the methylation- 
sensitive enzyme but subjected to an amplification reaction using the same primers as for 
the digested DNA sample and then subjecting the products of this amplification reaction to 
said separation or other detection means, wherein the presence of the same amplification 
products in enzyme digested and non-digested samples is indicative of a methylation- 
sensitive enzyme-digestion-resistant marker, the absence and presence of amplification 
products in enzyme digested and undigested samples, respectively, is indicative of a 
methylation-sensitive enzyme-digestion-sensitive markers, and the presence and absence 
of amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-dependent marker. 

2. The method of Claim 1 wherein the junctions of methylated and unmethylated 
DNA are detected. 

3. The method of Claim 1 wherein the potentially-methylated sites are CpG or 
CpNpG nucleotides within the recognition sequence of the methylation-sensitive enzyme. 

4. The method of Claim 1 or 2 or 3 wherein the methylation-sensitive enzyme is 
selected from the list comprising AatTL, Aci\ Acll, Agel, Ascl, Aval, BamWi, Bsa Al, 
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BsaHl, BsiE, BsiW, BsrF, BssEHL, BstBI, Bst\Jl, Cla\, EagI, HaeU, Hgal, Hhal, HiriPl, 
HpaU 9 Mlol, Mspl, Nael, Narl, Notl, Nrul and Pmll. 

5. The method of Claim 4 wherein the methylation-sensitive enzyme is HpaTL or 
its functional equivalent. 

6. The method of Claim 5 wherein the methylation-sensitive enzyme is HpaU. 

7. The method of Claim 1 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
Q(3 replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

8. The method of Claim 7 wherein the amplification reaction is PCR. 

9. The method of Claim 8 wherein the amplification reaction is real-time PCR. 

10. The method of Claim 1 wherein the separation means is gel electrophoresis. 

11. The method of Claim 1 wherein the eukaryotic cell is a human or mammalian 
embryonic stem (ES) cell, embryonic germ (EG) cell, post-natal stem cell, committed cell 
or mature cell or a cell having a state of development in between any of the above cells. 

12. The method of Claim 1 wherein the eukaryotic cell is a cancer cell or cancer 
cell line. 

1 3 . The method of Claim 1 wherein the cell is a plant cell. 

14. The method of Claim 1 wherein the cell is a non-mammalian animal cell. 

15. The method of Claim 1 wherein the cell is a mammalian cell. 
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16. A method for determining the methylation profile of one or more CpG or 

CpNpG nucleotides at one or more sites within the genome of a eukaryotic cell or group of 
cells, said method comprising obtaining a sample of genomic DNA from said cell, 
digesting a sub-sample of said sample of genomic DNA with HpaTL which has a 
recognition nucleotide sequence corresponding to or within said sites, subjecting said 
digested DNA to an amplification reaction using primers comprising a nucleotide sequence 
capable of annealing to a non-cleaved form of a HpaTL cleavable nucleotide sequence and 
subjecting the products of the amplification reaction to separation or other detection means 
relative to a control but subjected to an amplification reaction using the same primers as 
for the digested DNA sample and then subjecting the products of the amplification reaction 
to the separation or other detection means, said control comprising another sub-sample of 
said sample of genomic DNA not subjected to digestion by HpaE, wherein the presence of 
amplification products in enzyme digested and non-digested samples is indicative of a 
i^all-digestion-resistant marker (H 1 ), the absence and presence of amplification products 
in enzyme digested and undigested samples, respectively, is indicative of a HpaTL- 
digestion-sensitive marker (H 3 ) and the presence and absence of amplification products in 
enzyme digested and undigested samples, respectively, is indicative of a i^pall-digestion- 
dependent marker (H d ). 



17. The method of Claim 16 wherein the junctions of methylated and unmethylated 

DNA are detected. 



18. The method of Claim 16 or 17 wherein the amplification reaction is PCR. 

1 9. The method of Claim 1 8 wherein the amplification reaction is real-time PCR. 

20. The method of Claim 16 wherein the eukaryotic cell is a human or mammalian 
embryonic stem (ES) cell, embryonic germ (EG) cell, post-natal stem cell, committed cell 
or mature cell or a cell having a state of development in between any of the above cells. 
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21 . The method of Claim 16 wherein the eukaryotic cell is a cancer cell or a cancer 
cell line. 

22. The method of Claim 16 wherein the cell is a plant cell. 

23. The method of Claim 16 wherein the cell is a non-mammalian animal cell. 

24. The method of Claim 14 wherein the cell is a mammalian cell. 

25. A methylation profile of the genome of a eukaryotic cell or group of cells, the 



methylation profile comprising the presence or absence of methylation at particular sites 
and/or junctions between methylated and unmethylated regions, said method comprising 
obtaining a sample of genomic DNA from said cell or group of cells, digesting a sub- 
sample of said sample of genomic DNA with a methylation-sensitive enzyme having a 
recognition nucleotide sequence comprising a nucleotide capable of methylation at one or 
more of the above-mentioned sites, subjecting said digested DNA to an amplification 
reaction using primers comprising a nucleotide sequence capable of annealing to a non- 
cleaved form of a cleavable nucleotide sequence of the methylation-sensitive enzyme 
recognition sequence and subjecting the products of the amplification reaction to 
separation or other detection means relative to a control but subjected to an amplification 
reaction using the same primers as for the digested DNA sample and then subjecting the 
products of the amplification reaction to the separation or other detection means, said 
control comprising another sub-sample of said sample of genomic DNA not subjected to 
digestion by the methylation-sensitive enzyme wherein the presence of amplification 
products in enzyme digested and non-digested samples is indicative of a methylation- 
sensitive enzyme-digestion-resistant marker, the absence and presence of amplification 
products in enzyme digested and undigested samples, respectively, is indicative of a 
methylation-sensitive enzyme-digestion-sensitive marker and the presence and absence of 
amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-dependent marker. 
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26. The method of Claim 25 wherein the junctions of methylated and unmethylated 
DNA are detected. 

27. The method of Claim 25 or 26 wherein the potentially-methylated sites are 
CpG or CpNpG nucleotides within the recognition sequence of the methylation-sensitive 
enzyme. 

28. The method of Claim 25 or 26 or 27 wherein the methylation-sensitive enzyme 
is selected from the list comprising Aaitt, Acil, AcK, Agel 9 AscU Aval, BamHI, Bsa Al, 
BsdKl, BsiE,BsiW, BsrF, BssHIl, BstBI, BsiUl, Clal, Eagl, HaeU, Hgal, Hhal, HinPI, 
HpaH, Mlol, Mspl, Nael, Narl 9 Noil, Nrul and PmlL 

29. The method of Claim 28 wherein the methylation-sensitive enzyme is HpaH or 
its functional equivalent. 

30. The method of Claim 29 wherein the methylation-sensitive enzyme is HpaH. 

31. The method of Claim 25 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
Qp replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

32. The method of Claim 3 1 wherein the amplification reaction is PCR. 

33. The method of Claim 32 wherein the amplification reaction is real-time PCR. 

34. The method of Claim 25 wherein the separation means is gel electrophoresis. 

35. The method of Claim 25 wherein the eukaryotic cell is a human or mammalian 
embryonic stem (ES) cell, embryonic germ (EG) cell, post-natal stem cell, committed cell 
or mature cell or a cell having a state of development in between any of the above cells. 
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36. The method of Claim 25 wherein the eukaryotic cell is a cancer cell or cancer 
cell line. 

37. The method of Claim 25 wherein .the cell is a plant cell. 

38. The method of Claim 25 wherein the cell is a non-mammalian animal cell. 

39. The method of Claim 21 wherein the cell is a mammalian cell. 

40. An assay device in the form of a kit useful for determining the methylation 



profile of one or more nucleotides at one or more sites within the genome of a eukaryotic 
cell, said kit comprising in compartmental form multiple compartments each adapted to 
comprise one or more of buffers, diluents and enyzmes as single or multiple components 
which are optionally required to be admixed prior to use, said kit further comprising 
instructions for use, wherein the method is conducted by obtaining a sample of genomic 
DNA from said cell, digesting a sub-sample of said sample of genomic DNA with a 
methylation-sensitive enzyme having a recognition nucleotide sequence comprising a 
nucleotide capable of being methylated at one or more of the above-mentioned sites, 
subjecting said digested DNA to an amplification reaction using primers comprising a 
nucleotide sequence capable of annealing to a non-cleaved form of a cleavable nucleotide 
sequence of the methylation-sensitive enzyme recognition sequence and subjecting the 
products of the amplification reaction to separation or detection means relative to a control 
but subjected to an amplification reaction using the same primers as for the digested DNA 
sample and then subjecting the products of the amplification reaction to the separation or 
other detection means, said control comprising another sub-sample of said sample of 
genomic DNA not subjected to digestion by the methylation-sensitive enzyme, wherein the 
presence of amplification products in enzyme digested and non-digested samples is 
indicative of a methylation-sensitive enzyme-digestion-resistant marker, and wherein the 
absence and presence of amplification products in enzyme digested and undigested 
samples, respectively, is indicative of a methylation-sensitive enzyme-digestion-sensitive 
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marker and wherein the presence and absence of amplification products in enzyme 
digested and undigested samples, respectively, is indicative of a methylation-sensitive 
enzyme-digestion-dependent marker. 

41. The kit of Claim 40 wherein the junctions of methylation and unmethylated 
DNA are detected. 

42. The kit of Claim 40 wherein wherein the potentially-methylated sites are CpG 
or CpNpG nucleotides within the recognition sequence of the methylation-sensitive 
enzyme. 

43. The kit of Claim 40 or 41 or 42 wherein the methylation-sensitive enzyme is 
selected from the list comprising AatTL, Aril, Acll, Agel, Ascl, Aval, BarriHI, Bsa Al, 
Bsam, BsiB, BsfW, BsrF, BssHII, BstBI, BstXJI, Clal, Eagl, Haell, Hgal, Hha\ HinP\ 
HpaU, Midi, AfopI, Nael, Narl 9 Noil, Nrul and Pm/L 

44. The kit of Claim 43 wherein the methylation-sensitive enzyme is HpaTL or its 
functional equivalent. 

45. The kit of Claim 44 wherein the methylation-sensitive enzyme is HpaK. 

46. The kit of Claim 40 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
Q(3 replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

47. The kit of Claim 46 wherein the amplification reaction is PCR. 

48. The kit of Claim 47 wherein the amplification reaction is real-time PCR. 

49. The kit of Claim 40 wherein the separation means is gel electrophoresis. 
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50. The kit of Claim 40 wherein the eukaryotic cell is a human or mammalian 
embryonic stem (ES) cell, embryonic germ (EG) cell, post-natal stem cell, committed cell 
or mature cell or a cell having a state of development in between any of the above cells. 

51. The kit of Claim 40 wherein the eukaryotic cell is a cancer cell or cancer cell 
line. 

52. The kit of Claim 40 wherein the cell is a plant cell. 

53. The kit of Claim 40 wherein the cell is a non-mammalian animal cell. 

54. The kit of Claim 40 which detects the junctions of methylated and 
unmethylated DNA in cells. 

55. A method for detecting a change in a cell's or group of cells' developmental 
state or a cell's or group of cells' exposure to an internal or external stimulus, said method 
comprising detecting a change in methylation profile over time by the method comprising 
obtaining a sample of genomic DNA from said cell, digesting a sub-sample of said sample 
of genomic DNA with a methylation-sensitive enzyme having a recognition nucleotide 
sequence comprising a nucleotide capable of being methylated at one or more of the 
above-mentioned sites, subjecting said digested DNA to an amphfication reaction using 
primers comprising a nucleotide sequence capable of annealing to a non-cleaved form of a 
cleavable nucleotide sequence of the methylation-sensitive enzyme recognition sequence 
and subjecting the products of the amplification reaction to separation or other detection 
means relative to a control but subjected to an amplification reaction using the same 
primers as for the digested DNA sample and then subjecting the products of the 
amplification reaction to the separation or other detection means, said control comprising 
another sub-sample of said sample of genomic DNA not subjected to digestion by the 
methylation-sensitive enzyme, wherein the presence of amplification products in enzyme 
digested and non-digested samples is indicative of a methylation-sensitive enzyme- 
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digestion=resistant marker, and wherein the absence and presence of amplification products 
in enzyme digested and undigested samples, respectively, is indicative of a methylation- 
sensitive enzyme-digestion-sensitive marker and wherein the presence and absence of 
amplification products in enzyme digested and undigested samples, respectively, is 
indicative of a methylation-sensitive enzyme-digestion-dependent marker. 

56. The method of Claim 55 wherein the junctions of methylated and unmethylated 
DNA are detected. 

57. The method of Claim 55 wherein the potentially-methylated sites are CpG or 
CpNpG nucleotides within the recognition sequence of the methylation-sensitive enzyme. 

58. The method of Claim 55 or 56 or 57 wherein the methylation-sensitive enzyme 
is selected from the list comprising AatU 9 AcO, AclL, AgeU Ascl, Aval, BarriHL, BsaAl, 
BsaHl 9 BsiE, BsiW, BsrF, BssHII, BstBI, BstUI, C/al, EagI, HaeSl, Hgal, Hha\ HinPI, 
HpalL, Mlol, Mspl, Nael, Narl, Noil Nml and PmIL 

59. The method of Claim 58 wherein the methylation-sensitive enzyme is HpaU or 
its functional equivalent. 

60. The method of Claim 59 wherein the methylation-sensitive enzyme is HpaR. 

61. The method of Claim 55 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
Q(3 replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

62. The method of Claim 6 1 wherein the amplification reaction is PCR. 

63. The method of Claim 62 wherein the amplification reaction is real-time PCR. 
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64. The method of Claim 61 wherein the separation means is gel electrophoresis. 

65. The method of Claim 61 wherein the eukaryotic celHs a human or mammalian 
embryonic stem (ES) cell, embryonic germ (EG) cell, post-natal stem cell, committed cell 
or mature cell or a cell having a state of development in between any of the above cells. 

66. The method of Claim 61 wherein the eukaryotic cell is a cancer cell or cancer 
cell line. 

67. The method of Claim 16 wherein the cell is a plant cell. 

68. The method of Claim 61 wherein the cell is a non-mammalian animal cell. 

69. A computer program product for assessing the likely phenotype of a cell based 



on methylome profile, said product comprising:- 

(1) code that receives an input value for one or more of features wherein 
said features are selected from:- 



(a) absence or presence of AMP marker Class I; 

(b) absence or presence of AMP marker Class II: 

(c) absence or presence of AMP marker Class HI; 

(d) absence or presence of phenotype mapped to a Class I marker; 

(e) absence or presence of phenotype mapped to a Class II marker; 

(f) absence or presence of phenotype mapped to a Class HI marker; 

(g) absence or presence of a junction between methylated and 
unmethylated DNA; and 

(h) absence or presence of phenotype mapped to a junction between 
methylated and unmethylated DNA; 



(2) a computer readable medium that stores the code. 
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70. A computer system for assessing the likely phenotype of a cell based on 

methylation profile wherein said computer system comprises:- 

(1) a machine-readable data storage medium comprising a data storage 
material encoded with machine-readable data, wherein said machine- 
readable data comprise values for one or more features, wherein said 
features are selected from:- 

(a) absence or presence of AMP marker Class I; 

(b) absence or presence of AMP marker Class II: 

(c) absence or presence of AMP marker Class III; 

(d) absence or presence of phenotype mapped to a Class I marker; 

(e) absence or presence of phenotype mapped to a Class II marker; 

(f) absence or presence of phenotype mapped to a Class HI marker; 

(g) absence or presence of a junction between methylated and 
unmethylated DNA; and 

(h) absence or presence of phenotype mapped to a junction between 
methylated and unmethylated DNA; 

(2) a working memory for storing instructions for processing said machine- 
readable data; 

(3) a central-processing unit coupled to said working memory and to said 
machine-readable storage medium, for processing said machine- 
readable data to provide comparison of phenotype and AMP marker 
classes; and 

(4) an output hardware coupled to said central processing unit for receiving 
said data of comparison. 
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71. An isolated stem cell or stem cell line or culture of stem cells characterized as 
having a methylation profile is determined by the method of Claim 1 or 16. 

72. A method for identifying a genetic sequence associated with a disease 
phenotype or unwanted trait, said method comprising determining the methylation profie 
within the genomes of normal compared to diseased tissue by obtaining a sample of 
genomic DNA from said cell or group of cells, digesting a sub-sample of said sample of 
genomic DNA with a methylation-sensitive enzyme having a recognition nucleotide 
sequence comprising a nucleotide sequence capable of methylation at one or more of the 
above-mentioned sites, subjecting said digested DNA to an amplification reaction using 
primers comprising a nucleotide sequence capable of annealing to a non-cleaved form of a 
cleavable nucleotide sequence of the methylation-sensitive enzyme recognition sequence 
and subjecting the products of the amplification to separation or other detection means 
relative to a control, said control comprising another sub-sample of said sample of 
genomic DNA not subjected to digestion by the methylation-sensitive enzyme but 
subjected to an amplification reaction using the same primers as for the digested DNA 
sample and then subjecting the products of this amplification reaction to said separation or 
other detection means, wherein the presence of the same amplification products in enzyme 
digested and non-digested samples is indicative of a methylation-sensitive enzyme- 
digestion-resistant marker, the absence and presence of amplification products in enzyme 
digested and undigested samples, respectively, is indicative of a methylation-sensitive 
enzyme-digestion-sensitive markers, and the presence and absence of amplification 
products in enzyme digested and undigested samples, respectively, is indicative of a 
methylation-sensitive enzyme-digestion-dependent marker wherein the presence of 
particular markers is indicative of normal or diseased tissue. 

73. The method of Claim 72 wherein the potentially-methylated sites are CpG or 
CpNpG nucleotides within the recognition sequence of the methylation-sensitive enzyme. 

74. The method of Claim 72 or 73 wherein the methylation-sensitive enzyme is 
selected from the list comprising AatTI, Acil, Acll, AgeU Ascl, Aval, BamHL, Bsa Al, 
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Bsanl, BsiE, BsiW, BsrF, BssiUl, BstBI, BsiUL, Ctol, Eagl, HaeR, Hgal, Hhal, HinP\ 
HpaTL, Mlol, Mspl, Nael, Narl, Noil, Nrul and Pmtl. 

75. The method of Claim 74 wherein the methylation-sensitive enzyme is HpaR or 
its functional equivalent. 

76. The method of Claim 75 wherein the methylation-sensitive enzyme is HpaR. 

77. The method of Claim 72 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
QP replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

78. The method of Claim 77 wherein the amplification reaction is PCR. 

79. The method of Claim 78 wherein the amplification reaction is real-time PCR; 

80. A method of treating a subject having a disease state associated with silencing 
of a genetic sequence as determined by the method of Claim 72, said method comprising 
introducing into said subject said genetic sequence flanked bymethylation-preventing 
sequences to thereby replace the silenced genetic sequences or introducing a protein or 
non-proteinaceous product which replaces the protein encoded by the silenced genetic 
sequence. 

81. The method of Claim 80 wherein the potentially-methylated sites are CpG or 
CpNpG nucleotides within the recognition sequence of the methylation-sensitive enzyme. 

82. The method of Claim 80 or 81 wherein the methylation-sensitive enzyme is 
selected from the list comprising AatH 3 Acil, AcO, Agel, AscI, Aval, BamHl, BsaAl, 
BsciHl, BsiE, BsiW, BsrY, BssKIl, BstBI, BsfUl, Clal, EagI, HaeU 9 Hgal, Hhal, HiriPl, 
HpaU, Mlol, Mspl, Nael, Narl, Noil, Nrul and Pmll. 
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83. The method of Claim 82 wherein the methylation-sensitive enzyme is HpaH or 
its functional equivalent. 

84. The method of Claim 83 wherein the methylation-sensitive enzyme is HpaU. 

85. The method of Claim 80 wherein the amplification reaction is selected from 
polymerase chain reaction (PCR), ligase chain reaction (LCR), gap filling LCR (GLCR), 
Q(3 replicase; stand displacment amplification (SDA); self-sustained sequence replication 
(3SR) nucleic acid sequence-based amplification (NASBA). 

86. The method of Claim 85 wherein the amplification reaction is PCR. 

87. The method of Claim 86 wherein the amplification reaction is real-time PCR. 
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SEQUENCE LISTING 

<110> The University of Queensland 
Carroll, Bernard B (US only) 
Harrison, Dion K (US only) 
Aung, Hnin T (US only) 

<120> A method 

<130> 2564280/EJH 

<140> International 
<141> 2002-09-13 

<150> AU PR7685 
<151> 2001-09-14 

<160> 38 

<170> Patentln version 3.1 

<210> 1 

<211> 10 

<212> DNA 

<213> primer 

<400> 1 

tttgcccgga 10 

<210> 2 

<211> 10 

<212> DNA 

<213> primer 

<400> 2 

tggaccggtg X0 

<210> 3 

<211> 10 

<212> DNA 

<213> primer 

<400> 3 

acccggtcac 10 

<210> 4 

<211> 10 

<212> DNA 

<213> primer 

<400> 4 

aacccgggaa 10 
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<210> S 

<211> 10 

<212> DNA 

<213> primer 

<400> 5 
ttcccgggtt 



<210> 6 

<211> 10 

<212> DNA 

<213> primer 

<400> S 
tttgcccggt 



<210> 7 

<211> 10 

<212> DNA 

<213> primer 

<400> 7 
cccggcataa 



<210> 8 

<211> 10 

<212> DNA 

<213> primer 

<400> 8 
cacccggatg 

<210> 9 

<211> 10 

<212> DNA 

<213> primer 

<400> 9 
tcagtccggg 



<210> 10 

<211> 10 

<212> DNA 

<213> primer 

<400> 10 
tgccggcttg 



<210> 11 
<211> 10 
<212> DNA 
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<213> primer 

<400> 11 
ccccggtaac 



<210> 12 

<211> 10 

<212> DNA 

<213> primer 

<400> 12 
cagtgccggt 

<210> 13 

<211> 10 

<212> DNA 

<213> primer 

<400> 13 
accggcttgt 



<210> 14 

<211> 10 

<212> DNA 

<213> primer 

<400> 14 
gtccggagtg 

<210> 15 

<211> 10 

<212> DNA 

<213> primer 

<400> 15 
acaccggaac 

<210> 16 

<211> 10 

<212> DNA 

<213> primer 

<400> 16 
cccggatggt 



<210> 17 

<211> 10 

<212> DNA 

<213> primer 

<400> 17 
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aagaccggga 



lu 



<210> 18 

<211> 10 

<212> DNA 

<213> primer 

<400> 18 

tcccggtgag 10 



<210> 19 

<211> 10 

<212> DNA 

<213> primer 



<210> 20 

<211> 10 

<212> DNA 

<213> primer 

<400> 20 

acccggaaac 10 

<210> 21 

<211> 10 

<212> DNA 

<213> primer 



<210> 22 

<211> 10 

<212> DNA 

<213> primer 

<400> 22 

agccgggtaa 10 



<210> 23 

<211> 10 

<212> DNA 

<213> primer 



<400> 19 
gaatccggca 



10 



<400> 21 
tgccggttca 



10 



<400> 23 
cccggataca 



10 
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<210> 24 

<211> 10 

<212> DNA 

<213> primer 

<400> 24 
cccggaagag 



<210> 25 

<211> 10 

<212> DNA 

<213> primer 

<400> 25 
ctaccggcac 



<210> 26 

<211> 10 

<212> DNA 

<213> primer 

<400> 26 
acctccggtc 



<210> 27 

<211> 10 

<212> DNA 

<213> primer 

<400> 27 
ctccggatca 



<210> 28 

<211> 10 

<212> DNA 

<213> primer 

<400> 28 
tttccgggag 



<210> 29 

<211> 10 

<212> DNA 

<213> primer 

<400> 29 
aggccggtca 



<210> 30 
<211> 10 
<212> DNA 
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<213> primer 



<400> 30 
caaccggtct 



10 



<210> 31 

<211> 10 

<212> DNA 

<213> primer 

<400> 31 

ccgccggtaa 10 



<210> 32 

<211> 10 

<212> DNA 

<213> primer 



<210> 33 
<211> 396 
<212> DNA 
<213> markers 

<400> 33 

tttcccgggt tgcctatcgc gtacttggca cacagttaag tgctcgagaa attgctaaac 6 0 

actcaacgaa tatttaccag aatacatatt tatgctccac cgttgctgaa agaatttcga 12 0 

ctaacccttc tttaagtctc tcttaccctc ctgtactatt aacaccattg taagtaggca 18 0 

gtggagctac atttgcaggg agagagaaga gcgaaaaatt ggcgcgcaga gctgtaaagc 24 0 

gtttttctcc aggtcacacc gcagcctgta aatttgtgac acttggaccg attatgggac 30 0 

agcttgcaaa caaattatct gcaaactaac gaagcctatt tttaaagatc cctgttcctc 360 

gcagctctag gcgatgacat cagagaaccc gggaaa 3 96 



<210> 34 

<211> 259 

<212> DNA 

<213> markers 



<400> 32 
tccgggactc 



10 



<400> 34 

taacccggga aataaattga gaggtttttt aggcttactg catgttggct attctgctcc 



60 



atagagggag gttcctgttt ccttgctttg cctcccctag tcacgccaga taaacatttt 



120 



ccaaaagcaa tttgacgcgc taaatttaag tatctcccaa gacacagggt tcctaagtaa 



180 
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cactgagccc ttgcagcaga aacccagtag ggtccatggg cattgcatgt ttaaaggtgt 240 
tcttgggatt cccgggtta 259 

<210> 35 

<211> 299 

<212> DNA 

<213> markers 

<400> 35 

ttggaccggt gtccagtcaa gggaataagg aggtgcattc gatgactgga gttagggcac 60 

tttaaatgca ggactgaaaa tcagaaagga cgtgttctcc cccgagaaag cagagagagc 120 

agagcaaagc gagaagaatt ggggatgctt gcctcatcca ggttataaaa aaaaaaaaaa 180 

ggcctccagc cttgggtgat ggcaatgttg tttgtgctgt tacaatgcag cagattttac 240 

attttgccct ctttgcaaac ctattagcgg acactgctga agttcccaca ccggtccaa 299 

<210> 36 

<211> 103 

<212> DNA 

<213> markers 

<400> 36 

tcacccggat gacatgtctg ttgacaattt taccaggtaa ttaaggaggt aatttcctac 60 
acggtatagg tgaattgtcc agtctttaaa cacatccggg tga 103 

<210> 37 
<211> 1750 
<212> DNA 

<213> rpA2 promoter element 
<400> 37 

caggcgcccg ccaccacacc cagctaattt tttatatttt tagtagagat ggggtttcac 60 

cgtgttagcc agcatggtct tgatctcctg atctcgtgat cagggaaatc tgtttaatta 120 

aaacacgctt tgcgatttct gcccagtgct ctgaatgtca aagtcaagag taggaggaag 180 

aagaaagaag aaagcagggt gcagtggctg acgcctgtaa tcccaacact ttgggaggcc 240 

aaggtgggtg gatcacttga ggtcaggagt tcgagaccag cctgaccagc agggcgaaac 3 00 

ccccgtctct actaaaaata caaaaattag cagggcgtgg ttgcacgtac ctgtaatccc 3 60 

agctacgccg gaggctaagg caggaaaatc gcttgaaccc gggaggcgga ggttgcagtg 420 

aggtgagatc gcaccattgc actccagcct gggcaacaag agcaaaactc agtttccgaa 4 80 

aaaaaaaaaa aaaggaaaaa gaagagaaga aaaaaacaca cagtcctgga gcccaaaggc 540 
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catgtctatc ttacttaacc attgaatccc agtggccagc acagggccag gaacaaagaa 600 

ggcataaaca attctttttt tttttttttt ttttttttga gtcgcagtct ctctctgcca 660 

cctaggctgg agtgcagtgg cagcatcaca gctcactgca gcctggacac cccagggcca 720 

ggtgatcctc ccaccttagc caccccagta gctgggaccc caggcacttg ccaccacacc 78 0 

agactaattt ttaaaaatat tttttgagag agggtctcac tatgttgtcc aggctggtct 840 

caaacttcca gcctcaagcg gtcctcctgc ctcagacccc atttgctggg tttacaggca 900 

tgagccacag cacctgctaa tttttcttaa atacataaat gaacataaaa ttctaacaat 960 

gcatgagtat tttgaggaag gaactgacaa aatgttccac tccctatggg aggcacgtta 1020 

tatgaagaat tatgaaaaat ggtcgaaatg actggagagg ccaagcctgg atgagactgg 108 0 

gatggggaca ggtgcgggac gaggggcacc accctcacat ctttcacaag tctgtcatag 1140 

gcaagagggc gtaggtttct cacagcccca ctggggagaa tcggcaccat tggtggcatt 1200 

acacgaagag aatgtgacct cctatgtaaa agaacaagca actccacgcg gtgctgtgag 12 60 

gctagtgctg cgagtccctg aggtgcgcaa ttcccgcacg accgtgggtg ggaaacaccg 132 0 

aagccaaaac tccgctacag ccctttagat gaaggcgtcg tctgattggt gatagtttgg 1380 

cgcgaacctg agcacgccga acaaaggaag tgacggcaga agtcgcgcac ttgacgaggg 1440 

tgggatcaca cggcgctgcg tcgcggtagt attgttctga ttggttgatt tcttgcgata 1500 

ccgctctgcc agccccttgc ttccgctagt gcggagggtt ttgcccttcg taaagatggc 1560 

cgcggaggct tttggagcca actgggagcg cagtacgcgt tttctggagc atgggcagag 162 0 

gagacaggaa caagcgtagc atccgtgagc accgattggc tgaagcgagc accccgggag 1680 

ctgactggct ccgccattcg cgggaaggcg tttgtggtgc cagagaaaag tagccagagc 1740 

ggcgcagtgg 1750 

<210> 38 
<211> 1510 
<212> DNA 

<213> rbp21 promoter element 
<400> 38 

ctgcagagag gatgaccctt gtcaccaccg tcatgtacga ggctgcttca ccactgcctc 60 

actgccacca gcgtctcccg ccgcgtgcaa tacaagaaga aacatcgaac ggtcatataa 120 

ggtaagaccc actaccgatt taacctatca ttcccacaat ctaatccact tatttctctt 180 

cccatgatct tatcctctca tttctcctca ctacttttgc atttgtagga aacacaatga 240 
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caccgtcgaa 


gaaagctggt 


ggagcaccgt 


agccagcaat 




cagaggggag 


*5 f\ n 


aaaafccaaca 


gcggccatgc 


ggacggcgat 


gagacaacgc 




gggaggagga 


ooU 


cafctaacaat 


catgctggtg ttggcggagg aggtcactgg 




gacagcgggg 


4 1 A 


cacjcacaaca 


caaaaagggg 


ggaggatgcc 


ggcgaccacg 




gaagcaagat 


/inn 

480 


cr a t cr "h ct a a a a 


ggaggaccgg 


acgagggttg 


gacctctgcc 


gccgacgtga 


agagcgtgat 


540 


a t a t an a a aa 


agatgttaga 
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